WO 21105/096804 



10/591870 

IAP12 Rec'd PCT/PTO 0 7 SEP 2006 

PCT/US2004/007182 



SEQUENCE LISTING 



<1!0> Lanahan, Mike 

<120> Self -processing Plants and Plant Parts 

<130> 109346.31*7 

<140> US 60/315,281 

<H!> 2001-08-27 

<160> 112 
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<210> 1 

< 2 1 1 > «J 3 6 
<212> PKT 

<213> Artificial Sequence 
<220^ 

<223> synthetic 



<400> 1 



Met 


Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu 


Glu 


Gly Gly 


Val 


lie 


Met 


Gin 


Ala 


1 




5 










10 










15 




Phe 


Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 


lie 


Trp 


Trp Asp 


Thr 


lie 


Arg 




20 










25 










30 






Gin 


Lys 


lie 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala 


Gly 


lie 


Ser 


Ala 


lie 


Trp 


He 






35 






4 0 










45 








Pro 


Pre 


Ala 


Ser 


Lys 


Gly 


Met 


Ser 


Gly 


Gly Tyr 


Ser 


Met 


Gly 


Tyr 


Asp 




50 








55 










60 










Pro 


Tyr 


Asp 


Tyr 


Phe 


Asp 


Leu 


Gly 


Glu 


Tyr Tyr 


Gin 


Lys 


Gly 


Thr 


Val 


65 






70 










75 










80 


Glu 


Thr 


Arg 


Phe 


Gly 


Ser 


Lys 


Gin 


Glu 


Leu 


lie 


Asn 


Met 


lie 


Asn 


Thr 








85 










90 










95 




Ala 


His 


Ala 


Tyr 


Gly 


lie 


Lys 


Val 


lie 


Ala 


Asp 


lie 


Val 


lie 


Asn 


His 








100 






105 










110 






A r g 


Ala 


Gly 


Gly 


Asp 


Leu 


Glu 


Trp 


Asn 


Pro 


Phe 


Val 


Gly 


Asp 


Tyr 


Thr 




115 










120 










125 








Trp 


Thr 


Asp 


Phe 


Ser 


Lys 


Val 


Ala 


Ser 


Gly 


Lys 


Tyr 


Thr 


Ala 


Asn 


Tyr 


130 








135 










140 










Leu 


Asp 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly 


Asp 


Ser 


Gly 


Thr 


Phe 


145 








150 










155 










160 


Giy 


Gly 


Tyr 


Pro 


Asp 


lie 


Cys 


His 


Asp 


Lys 


Ser 


Trp 


Asp 


Gin 


Tyr 


Trp 






165 










170 










175 




Leu 


i r ; ^ 


A 1 a 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


lie 


Gly 






180 








185 










190 






I le 


Asd 


Ala 


Trp 


Ara 


Phe 


Asp 


Tyr 


Val 


Lys 


Gly 


Tyr 


Gly 


Ala 


Trp 


Val 




• 


135 








200 










205 








Val 


Lys 


Asp 


Trp 


Leu 


Asn 


Trp 


Trp 


Gl y 


Gly 


Trp 


Ala 


val 


Gly 


Glu 


Tyr 




210 






215 










220 










Trp 


Asp 


Thr 


Asn 


val 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 


Ser 


Gly 


225 








230 










235 










240 


Ala 


Lys 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 


Ala 


Phe 








245 










250 










255 




Asp- 


Asn 


Lys 


Asn 


lie 


Pro 


Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn 


Gly Gly 
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260 




















2/0 






Thr 


Val 


Va 1 
275 


Ser 
»j ^ i. 


A rn 




Pro 


Php 
rue 

c. 0 V 


Lf y b 


jhi a 


Val 


Thr 


rne 


va l 


Ala 


Asn 


His 


Asp 


Thr 


As D 


lie 


1 1 e 


T r o 


A n 




i y r 


Pro 


Ala 


I yr 


Mia 


rne 


lie 




290 




















300 








Leu 


Thr 


Tyr 


Glu 


Gl v 

* 


Gin 


Pro 


Thr 


Tie 


Php 


Tyr 


Arg 


Asp 


i yr 


<j J. U 


Glu 


305 










310 

•J -I V 










315 




•5 /-\ 
320 


Trp 

* 


Leu 


Asn 


L vs 


Asp 


Lvs 


Leu 


Lvs 


Asn 


Lp ij 

i-j Li 


He 


Trp 


T 1 p 


His 


Asp 


A 0 










325 










i "J n 








j J 0 




Leu 

^-f w 


Ala 


Gl v 


Gl v 


Se r 


Th r 




T 1 p 
lit? 


v a X 




Tyr 


Asp 




MSp 


GlU 


Met 


1 } o 
















■Jj q. 
Jl J 






350 






Phe 


Va l 


A r o 




C 1 v 

u i y 


T v r 


u i y 


^P r 


T v / c 

uy s 


Pro Gly 


Leu 


lie 


Tnr 


Tyr 






1 S ^ 




















365 






He 


Asn 


Leu 


Gly 


Ser 


Ser 


Lys 


Val 


Gly 


Arg 


Trp 


Val 


Tyr 


Val 


Pro 


Lys 




3 / 0 










375 










380 






Phe 


Ala 


Gly 


Ala 


Cys 


lie 


His 


Glu 


Tyr 


Thr 


Gly 


Asn 


Leu 


Gly 


Gly 


Trp 


385 










390 










395 






400 


Val 


Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


Ala 


Pro 










405 










410 








415 




Ala 


Tyr 


Asp 


Pro 


Ala 


Asn 


Gly 


Gin 


Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 


Ser 


Tyr 








420 










425 










430 




Cys 


Gly 


Val 
435 


Gly 



























<210> 2 
<211> 1308 
<212> DMA 

<213> Artificial Sequence 

<220> 

<22 3> synthetic 
<A00> 2 

atggccaagc acctggagct ggaggagggc ggcgtgatca tgcaggcgtt ctactgggac 60 
gtcccgagcg gaggcatctg gtgggacacc atccgccaga agatccccga gtggtacgac 120 
gccggcatct ccgcgatctg gataccgcca gcttccaagg geatgtcegg gggctactcg 180 
atgggccacg acccgtacga ctacttcgac cteggegagt actaccagaa gggcacggtg 240 
gagacgeget tegggtccaa gcaggagctc atcaacatga tcaacacggc gcacgcctac 300 
ggcatcaagg ccatcgcgga categtgate aaccacaggg ccggcggcga cctggagtgg 360 
aacccgttcg teggegacta cacctggacg gacttctcca aggtcgcctc eggcaagtae 420 
accgccaact acctcgactt ccaccccaac gagctgeacg cgggcgactc cggcacgttc 480 
ggcggctacc eggacatctg ccacgacaag tcctgggacc agtactggct ctgggcctcg 540 
caggagtcct acgcggccta cctgcgctcc ateggcateg acgcgtggcg cttcgactac 600 
gtcaagggct aeggggectg ggtggtcaag gactggctca actggtgggg cggctgggcg 660 
gtgggcgaqc actgcgacac caacgtcgac gcgctgctca actgggecta ctcctccggc 720 
gccaaggtgt tcgacttccc cctgtactac aagatggacg cggccttcga caacaagaac 780 
atcccggcgc ccgtcgaggc cctgaagaac ggeggcaegg tggtctcccg cgacccgttc 840 
aaggccgtga ccttegtcgc caaccacgac aeggacatea tctggaacaa gtacccggcg 900 
tacgccttca tcctcaccta egagggecag cccacgatct tctaccgcga ctacgaggag 960 
tggctgaaca aggacaagct caagaacctg atctggattc acgacaacct cgcgggcggc 1020 
tccaccagta tcgtgtacta cgactccgac gagatgatct tcgtccgcaa cggctacggc 1080 
tccaagcccg gcctgatcac gtacatcaac ctgggctcct ccaaggtggg ccgctgggtg 1140 
tacgccccga agttcgccgg cgcgtgcatc cacgagtaca ccggcaacct eggeggctgg 1200 
gtggacaagc acgtgtactc ctccggctgg gtctacctgg aggccccggc ctacgacccc 1260 
gccaacggcc agtaeggcta ctccgtgtgg tcctactgcg gegtegge 1308 

<2j0> 3 
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<2l \> 800 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 

<^500> 3 

Met Gly His Trp Tyr Lys His Gin Arg Ala Tyr Gin Phe Thr Gly Glu 

15 10 15 

Asp Asp Phe Gly Lys Val Ala Val Val Lys Leu Pro Met Asp Leu Thr 

20 25 30 

Lys Val Gly lie lie Val Arg Leu Asn Glu Trp Gin Ala Lys Asp Val 

35 4 0 4 5 

Ala Lys Asp Arg Phe lie Glu lie Lys Asp Gly Lys Ala Glu Val Trp 

50 55 60 

lie Leu Gin Gly Val Glu Glu lie Phe Tyr Glu Lys Pro Asp Thr Ser 
65 70 75 80 

Pro Arg lie Phe Phe Ala Gin Ala Arg Ser Asn Lys Val lie Glu Ala 

35 90 95 

Phe Leu Thr Asn Pro Val Asp Thr Lys Lys Lys Glu Leu Phe Lys Val 

100 105 110 

Thr Val Asp Glv Lys Glu lie Pro Val Ser Arg Val Glu Lys Ala Asp 

115 120 125 

Pro Thr Asp lie Asp Val Thr Asn Tyr Val Arg He Val Leu Ser Glu 

130 135 140 

Ser Leu Lys Glu Glu Asp Leu Arg Lys Asp Val Glu Leu lie He Glu 
MS 150 155 160 

Gly Tyr Lys Pro Ala Arg Val lie Met Met Glu lie Leu Asp Asp Tyr 

165 170 175 

Tyr Tyr Asp Gly Glu Leu Gly Ala Val Tyr Ser Pro Glu Lys Thr He 

180 185 190 

Phe Arg Val Trp Ser Pro Val Ser Lys Trp Val Lys Val Leu Leu Phe 

195 200 205 

Lys Asn Gly Glu Asp Thr Glu Pro Tyr Gin Val Val Asn Met Glu Tyr 

210 215 220 

Lys Gly Asn Gly Val Trp Glu Ala Val Val Glu Gly Asp Leu Asp Gly 
225 230 235 240 

Val Phe Tyr Leu Tyr Gin Leu Glu Asn Tyr Gly Lys lie Arg Thr Thr 

245 250 255 

Val Asp Pro Tyr Ser Lys Ala Val Tyr Ala Asn Asn Gin Glu Ser Ala 

260 265 270 

Val Val Asn Leu Ala Arg Thr Asn Pro Glu Gly Trp Glu Asn Asp Arg 

27S 2S0 285 

Gly Pro Lvs lie Glu Gly Tvr Glu Asp Ala lie lie Tyr Glu He His 

290 295 300 

lie Ala Asp lie Thr Glv Leu Glu Asn Ser Gly Val Lys Asn Lys Gly 
305 310 315 320 

Leu Tyr Leu Gly Leu Thr Glu Glu Asn Thr Lys Gly Pro Gly Gly Val 

325 330 " ' 335 

Thr Thr Gly Leu Ser His Leu Val Glu Leu Gly Val Thr His Val His 

340 345 350 

lie Leu Pro Phe Phe Asp Phe Tyr Thr Gly Asp Glu Leu Asp Lys Asp 

355 360 365 

Phe Glu Lys Tyr Tyr Asn Trp Gly Tyr Asp Pro Tyr Leu Phe Met Val 

370 375 380 

Pro Glu Glv Arg Tyr Ser Thr Asp Pro Lys Asn Pro His Thr Arg He 
385 390 395 400 
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A r n 


Glu 

VJ *■ \* 


Va 1 


Lvs 

* 


Glu 


Met 


Va 1 

* LJ J- 


Lvs 


Ala 


Le u 


His 




His 

1 1 1 J 


Glv 

vj x y 


lie 


Gl v 

va y 










405 










4 10 










4 15 




Va 1 


lie 

A X V 


Met 


Asp 


Met 


Va 1 


Phe 

l i » v~ 


Pro 


His 


Thr 


Tvr 

i y i 


Glv 


lie 

X X c 


Glv 


Glu 


Le u 








420 










425 










430 






Ser 


A 1 a 


Phe 


Asp Gin 


Thr 


Va 1 


Pro 


Tvr 


Tvr 


Phe 


Tvr 
i y x 


Ar o 


He 


Asd 


Lvs 






435 










440 










445 








Thr 


G 1 v 


Ala 


Tyr 


Leu 




Glu 

VJ -A- Vi 


Ser 


Gl v 


Cvs 

w jr w 


Glv 

\j x y 


Asn 


Va 1 


lie 


Ala 


Ser 




4 so 










455 










4 60 










Glu 

VJ vj 


A r a 


Pro 


Me t 


Me t 


Ara 


Lvs 


Phe 


lie 


Val 


Asd 

■ * w VV 


Thr 


Va 1 

» C X 


Thr 


Tvr 
i y x 


Trn 












4 70 










475 










4 80 


Va 1 

• a X 


L y j 


Glu 

VJ X u 


1 y - 


His 

H i. V 


Tie 

IXC 




Glv 

vj x y 


Phe 


A r n 


Phe 

r lie 


A ^ n 
nop 


Gl n 

VJ X 1 1 


Met 

1 IC V 


Gl v 

vj x y 


Ten 










4 8 5 










490 










495 




lie 

IXC 


A«;r> 


f .V 
by j 


Lvs 


Thr 


Me t 


T ,pi) 
ij e 


Glu 


Va 1 

V CI X 


Glu 

VJ x U 


Ara 


Ala 


I.P 11 

UC VJ 


His 


x> y o 


He 

lie 








•J \J \J 










505 










510 

■W A v 








. x vj 


Th r 

I til. 


He 


lie 

i x c 


I .p 11 


Tvr 

i > X 


Glv 


Glu 


Pro 

L X w 


Tro 

j x y 


Glv 

vj x y 


Glv 

vj x y 


Tro 


Glv 

vj x y 


Ala 

t \ X CI 
















520 










525 

*J C ~J 








P rn 


T 1 p 

X X vT 


£ r n 
#• x vj 


Phe 

L lie; 


Glv 

vj x y 


1 . v 
ijy o 


Ser 

•J C X 


Aso 


Va 1 


Ala 


Glv 

x y 


Thr 

1 1 I X 


His 


Va 1 

VOX 


Ala 


Ala 




5 30 










535 
-j *j 










540 












r.i 1 1 




Glu 


Phe 

4 * * v^ 


A r n 
n x vj 




A 1 Pi 

r\ x a 


lie 

X X c 


A r n 


Glv 

ux y 


Ser 


Val 

vox 


Phe 


Asn 

non 


Pro 

X X w 












*j <j \j 










555 

-J ~j 










560 

J V V 




■ CS X 




Glv 


Phe 


val 


Met 


vj x y 


Glv 

u i y 


i y t 


Gl v 
yi y 


I . V ^ 
uy s 


Glu 

w x. u 


Thr 

1 1 1 X 


uy j 


Tie 

lie 










565 










570 










57 5 




It j 5 


S r n 
ni U 


Ui V 


Va 1 


Va 1 


vj x y 


OCX 


T 1 p 


A n 


Tvr 
l yx 


A«;n 


ni v 

vj x y 


I . V ^ 
uy j 


I P n 


I le 


Liy o 








S80 










58 ^ 










590 






Se r 


Phe 


Al a 


Le u 


As D 


Pro 

• X VJ 


Glu 

VJ -J- w 


W X U 


Thr 


He 

X X w 


Asn 


Tvr 

i y l 


Ala 


Ala 


Cvs 


His 

> I X w 






595 










600 

W V V 










605 








Asp 


Asn 


His 


Thr 


Leu 


Tr d 

J X fj 


As D 


Lvs 


Asn 


Tvr 


Le u 


Ala 


Ala 


Lvs 

^J Y w 


Ala 


Asd 




610 










615 










620 










Lys 


Lys 


L v s 


Glu 


Trp 


Thr 


G i u 


Glu 


Glu 


Leu 


Lvs 


Asn 


Ala 


Gin 


Lvs 


Leu 


625 










630 










635 










640 


Ala 


Gly 


Al a 


I le 


Leu 


Leu 

4J V w 


Thr 


Ser 


Gin 


Glv 


Va 1 


Pro 


Phe 

hit w 


Le u 

±J w w 


His 


Glv 

vj x y 








645 










650 










655 




Gly Gin 


A S D 


Phe 


Cys 


Ara 

4 i A- VJ 


Thr 


Thr 


Asn 


Phe 


Asn 


Asd 


Asn 

• » sJ i * 


Ser 


Tvr 

* y x 


Asn 








660 










665 










670 






Ala 


Pro 


r le 


Ser 


lie 


Asn 


Gl v 


Phe 


Asd 


Tvr 

l > x 


Glu 


Arg 


Lys 


Leu 


Gin 


Phe 






675 










630 










685 








I Le 


Asp 


Val 


Phe 


Asn 


Tvr 


His 


Lys 


Glv 


Leu 


lie 


Lys 


Leu 


Arg 


Lvs 


Glu 




690 










695 










700 










His 


Pro 


Ala 


Phe 


Arg 


Le u 


Lys 


Asn 


Ala 


Glu 


Glu 


He 


Lvs 


Lvs 


His 


Leu 


705 










710 










715 










720 


Glu 


Phe 


Leu 


Pro 


Gly 


Gly 


Arg 


Arg 


lie 


Val 


Ala 


Phe 


Met 


Leu 


Lys 


Asp 










725 










730 










735 




His 


Ala 


Glv 


Gly 


Asp 


Pro 


Trp 


Lys 


Asp 


lie 


Val 


Val 


lie 


Tyr 


Asn 


Gly 








740 










745 










750 






Asn 


Leu 


Glu 


Lys 


Thr 


Thr 


Tyr 


Lys 


Leu 


Pro 


Glu 


Gly 


Lys 


Trp 


Asn 


Val 






755 










760 










765 








Val 


Val 


Asn 


Ser 


Gin 


Lys 


Ala 


Gly 


Thr 


Glu 


Val 


lie 


Glu 


Thr 


Val 


Glu 




770 










775 










780 










Gly 


Thr 


He 


Glu 


Leu 


Asp 


Pro 


Leu 


Ser 


Ala 


Tyr 


Val 


Leu 


Tyr 


Arg 


Glu 



735 790 795 800 



<210> 4 

<2I1> 2400 

<212> ON A 

< 2 1 3 > Artificial Sequence 
<220> 
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* 2 2 3 > synthetic 
<400> 4 

atgggccact ggtacaagca ccagcgcgcc taccagttca ccggcgagga cgacttcggg 60 
aaggtggccg tggtgaagct cccgatggac ctcaccaagg tgggcatcat cgtgcgcctc 120 
aacgagtggc aggcgaagga cgtggccaag gaccgcttca tcgagatcaa ggacggcaag 180 
gccgaggtgt ggatactcca gggcgtggag gagatcttct acgagaagcc ggacacctcc 240 
ccqcgcatct tcttcgccca ggcccgctcc aacaaggtga tcgaggcctt cctcaccaac 300 
ccggtggaca ccaagaagaa ggagctgttc aaggtgaccg tcgacggcaa ggagatcccg 360 
gtgtcccgcg tggagaaggc cgacccgacc gacatcgacg tgaccaacta cgtgcgcatc 420 
gtgctctccg agtccctcaa ggaggaggac ctccgcaagg acgtggagct gatcatcgag 480 
ggctacaagc cggcccgcgt gatcatgatg gagatcctcg acgactacta ctacgacggc 540 
gagctggggg cggtgtactc cccggagaag accatcttcc gcgtgtggtc cccggtgtcc 600 
aagtgggtga aggtgctcct ctccaagaac ggcgaggaca ccgagccgta ccaggtggtg 660 
aacatggagt acaagggcaa cggcgtgtgg gaggccgtgg tggagggcga cctcgacggc "720 
gtgttctacc tccaccagct ggagaactac ggcaagatcc gcaccaccgt ggacccgtac 780 
tccaaggccg tgtacgccaa caaccaggag tctgcagtgg tgaacctcgc ccgcaccaac 840 
ccggagggct gggagaacga ccgcggcccg aagatcgagg gctacgagga cgccatcatc 900 
tacgagatcc acatcgccga catcaccggc ctggagaact ccggcgtgaa gaacaagggc 960 
ctctacctcg gcctcaccga ggagaacacc aaggccccgg gcggcgtgac caccggcctc 1020 
tcccacctcg tggagctgcg cgtgacccac gtgcacatcc tcccgttctt cgacttctac 1080 
accggcgacg agctggacaa ggacttcgag aagtactaca actggggcta cgacccgtac 1140 
ctcttcatgg cgccggaggg ccgctactcc accgacccga agaacccgca cacccgaatt 1200 
cgcgaggtga aggagatggt qaaggccctc cacaagcacg gcatcggcgt gatcatggac 1260 
atggtgttcc cgcacaccta cggcatcggc gagctgtccg ccttcgacca gaccgtgccg 1320 
tactacttct accgcatcga caagaccggc gcctacctca acgagtccgg ctgcggcaac 1380 
gtgatcgcct ccgagcgccc gatgatgcgc aagttcatcg tggacaccgt gacctactgg 1440 
gtgaaggagt accacatcga cggcttccgc ttcgaccaga tgggcctcat cgacaagaag 1500 
accatgctgg aggtggagcg cgccctccac aagatcgacc cgaccatcat cctctacggc 1560 
gagccgtggg gcggctgggg ggccccgatc cgcttcggca agtccgacgt ggccggcacc 1620 
cacgtggccg ccttcaacga cgagttccgc gacgccatcc gcggctccgt gttcaacccg 1680 
tccgtgaagg gcttcgtgat gggcggctac ggcaaggaga ccaagatcaa gcgcggcgtg 1740 
gtgggctcca tcaactacga cggcaagctc atcaagtcct tcgccctcga cccggaggag 1800 
accatcaact acgccgcctg ccacgacaac cacaccctct gggacaagaa ctacctcgcc 1860 
gccaaggccg acaagaagaa ggagtggacc gaggaggagc tgaagaacgc ccagaagctc 1920 
gccggcgcca tcctcctcac tagtcagggc gtgccgttcc tccacggcgg ccaggacttc 1980 
tgccgcacca ccaacttcaa cgacaactcc tacaacgccc cgatctccat caacggcttc 2040 
gactacgagc gcaagctcca gttcatcgac gtgttcaact accacaaggg cctcatcaag 2100 
ctccgcaagg agcacccggc cttccgcctc aagaacgccg aggagatcaa gaagcacctg 2160 
gagttcctcc cgggcgggcg ccgcatcgtg gccttcatgc tcaaggacca cgccggcggc 2220 
gacccgtgga aggacatcgt ggtgatctac aacggcaacc tggagaagac cacctacaag 2280 
ctcccggagg gcaagtggaa cgtggtggtg aactcccaga aggccggcac cgaggtgatc 2340 
gagaccgtgg agggcaccat cgagctggac ccgctctccg cctacgtgct ctaccgcgag 2400 



<210> 5 
<2\\> 693 
<2)2> PRT 

< 2 1 3 > Suifolobus sclfataricus 
<400> 5 

Met Glu Thr lie Lys lie Tyr Glu Asn Lys Gly Val Tyr Lys Val Val 

1 5 10 15 

lie Gly Glu Pro Phe Pro Pro Tie Glu Phe Pro Leu Glu Gin Lys He 

20 25 30 

Ser Ser Asn Lys Ser Leu Ser Glu Leu Gly Leu Thr He Val Gin Gin 

35 40 45 

Gly Asn Lys Val He Val Glu Lys Ser Leu Asp Leu Lys Glu His lie 
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50 










55 










60 










lie 


Gly 


Leu 


Gly 


Glu 


Lys 


Ala 


Phe 


Glu 


Leu 


Asp 


Arg 


Lys 


Arg 


Lys 


Arg 


65 










70 










75 










80 


Tyr 


Val 


Met 


Tyr 


Asn 


Val 


Asp 


Ala 


Gly 


Ala 


Tyr 


Lys 


Lys 


Tyr 


Gin 


Asp 










85 










90 










95 




Pro 


Leu 


Tyr 


Va 1 


Ser 


lie 


Pro 


Leu 


Phe 


lie 


Ser 


val 


Lys 


Asp 


Gly 


Val 








100 










105 










110 




Ala 


Thr 


Gly 


Tyr 


Phe 


Phe 


Asn 


Ser 


Ala 


Ser 


Lys 


Val 


lie 


Phe 


Asp 


Val 






115 










120 










125 






Gly 


Leu 


Glu 


Glu 


Tyr 


Asp 


Lys 


Val 


lie 


Val 


Thr 


lie 


Pro 


Glu 


Asp 


Ser 




130 










135 










140 










Val 


Glu 


Phe 


Tyr 


Val 


lie 


Glu 


Gly 


Pro 


Arg 


lie 


Glu 


Asp 


Val 


Leu 


Glu 


145 










150 










155 










160 


Lys 


Tyr 


Thr 


Glu 


Leu 


Thr 


Gly 


Lys 


Pro 


Phe 


Leu 


Pro 


Pro 


Met 


Trp 


Ala 










165 










170 










175 




Fhe 


Gly 


Tyr 


Met 


lie 


Ser 


Arg 


Tyr 


Ser 


Tyr 


Tyr 


Pro 


Gin 


Asp 


Lys 


Val 








180 










185 










190 






Va 1 


Glu 


Leu 


Val 


Asp 

* 


lie 


Met 


Gin 


Lys 


Glu 


Gly 

* 


Phe 


Arg 


Val 


Ala 


Gly 






195 










200 










205 








val 


Phe 


Leu 


Asp 


lie 


His 


Tyr 


Met 


Asp 


Ser 


Tyr 


Lys 


Leu 


Phe 


Thr 


Trp 




210 










215 










220 










His 


Pro 


Tyr 


Arg 

-* 


Phe 


Pro 


Glu 


Pro 


Lys 


Lys 


Leu 


lie 


Asp 


Glu 


Leu 


His 


225 










230 










235 










240 


Lys 


Arg 


Asn 


Val 


Lys 


Leu 


He 


Thr 


Tie 


Val 


Asp 


His 


Gly 


lie 


Arg 


Val 










245 










250 










255 




ASp 


Gin 


Asn 


Tyr 


Ser 


Pro 


Phe 


Leu 


Ser 


Gly 


Met 


Gly 


Lys 


Phe 


Cys 


Glu 








260 










265 










270 






T ) a 


Glu 


Ser 


Gly 


Glu 


Leu 


Phe 


Val 


Gly 


Lys 


Met 


Trp 


Pro 


Gly 


Thr 


Thr 






275 










280 










285 








Val 


Tyr 


Pro 


Asp 


Phe 


Phe 


Arg 


Glu 


Asp 


Thr 


Arg 


Glu 


Trp 


Trp 


Ala 


Gly 




290 










295 










300 










L- \j 


lie 


Ser 


Glu 


Trp 


Leu 


Ser 


Gin 


Gly 


Val 


Asp Gly 


lie 


Trp 


Leu 


Asp 


305 










310 










315 










320 


Met 


Asn 


Glu 


Pro 


Thr 


Asp 


Phe 


Ser 


Arg 


Ala 


lie 


Glu 


lie 


Arg 


Asp 


Val 










325 










330 










335 




Leu 


Ser 


Ser 


Leu 


Pro 


Val 


Gin 


Phe 


Arg 


Asp 


Asp 


Arg 


Leu 


Val 


Thr 


Thr 








340 










345 










350 






Phe 


Pro 


Asp 


Asn 


Val 


Val 


His 


Tyr 


Leu 


Arg 


Gly 


Lys 


Arg 


Val 


Lys 


His 






jd5 










360 










365 








Glu 


Lys 


Val 


Arg 


Asn 


Ala 


Tyr 


Pro 


Leu 


Tyr 


Glu 


Ala 


Met 


Ala 


Thr 


Phe 




370 










375 










380 










Lys 


Gly 


Phe 


Arg 


Thr 


Ser 


His 


Arg 


Asn 


Glu 


lie 


Phe 


lie 


Leu 


Ser 


Arg 


3S5 










390 










395 










400 


Ala 


Gly 


Tyr 


Ala 


Gly 


He 


Gin 


Arg 


Tyr Ala 


Phe 


lie 


Trp 


Thr 


Gly 


Asp 










405 










410 










415 




Asn 


Thr 


Pro 


Ser 


Trp 


Asp 


Asp 


Leu 


Lys 


Leu 


Gin 


Leu 


Gin 


Leu 


Val 


Leu 








420 










425 










430 






Gly 


Leu 


Ser 


lie 


Ser 


Gly 


Val 


Pro 


Phe 


Val 


Gly 


Cys 


Asp 


He 


Gly 


Gly 
















440 










445 








Phe 


Gin 


G 1 w 


Arg 


Asn 


Phe 


Ala 


Glu 


lie 


Asp 


Asn 


Ser 


Met 


Asp 


Leu 


Leu 




4 50 










455 










4 60 










Val 


L y s 


Ty r 

* 


Tvr 


Ala 


Leu 


Ala 


Leu 


Phe 


Phe 


Pro 


Phe 


Tyr 


Arg 


Ser 


His 


4 65 










4 70 










475 








480 


Lys 


Ala 


Thr 


Asp 


Glv 


lie 


Asp 


Thr 


Glu 


Pro 


Val 


Phe 


Leu 


Pro 


Asp 


Tyr 










485 










490 










495 




Tyr 


Lys 


Glu 


Lys 


Val 


Lys 


Glu 


lie 


Val 


Glu 


Leu 


Arg 


Tyr 


Lys 


Phe 


Leu 








500 










505 










510 






Pro 


Tyr 


lie 


Tyr 


Ser 


Leu 


Ala 


Leu 


Glu 


Ala 


Ser 


Glu 


Lys 


Gly 


His 


Pro 
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S 1 s 










520 


Va 1 


lie 


A r n 




I PI) 
Li \m U 


Phe 

r lie 


Tvr 


Glu 




5 30 








5^5 




A ra 




Glu 


As D 


G 1 u 


T vr 


Me t 


Val 


5-15 










550 






T 1 A 




Ser 


Lvs 


Glu 


Glu 


Ser 


Ar a 








565 

■mj W W 








'Pro 


T v r 


A s n 


T v r 




A 5 n 


Gl v 


Glu 








580 










1 \; c 

-/ " 


^p r 


Th r 

1 1 1 L 


His 


Glu 


Lpu 

U v v 


Pro 


He 






S 9 ^ 

J 7 J 










600 

U v V 


Dm 


I Oil 
1-1 w t» 


OIL! 


g i v 

vj j y 


A ^ n 


Glu 




I J e 




6 1 Pi 










615 




.-. ro 


I yr 


Asp 


ft 

as n 


ft 1 a 

nld 


Vj 1 U 


lie 


i n r 


625 










630 






Ser 


Arg 


Glu 


lie 


Tyr 


Val 


Ser 


Lvs 








645 








Val 


Ser 


Lys 


lie 


lie 


Val 


Asp 


Asp 








660 










Thr 


Met 


Gin 


Asn 


Thr 


Tvr 


Val 


Ala 






675 










680 


Lys 


lie 


Asn 


Leu 


Glu 








690 























525 








Php 


Oi II 


Ben 


Ben 

540 


no p 


Ben 


Met 


Tvr 
j yr 


G 1 v 


1 wc 


i yr 
555 


? p 1 1 


1 on 


l yr 


Ala 


P ro 

560 




Va 1 
570 


Thr 


1-1 C U 




A rn 


Gl v 
575 


1 , V ^ 


lie 


T 1 P 


fi c n 


Gl v 


j_i y o 


^ p r 


Va 1 




*J O J 










590 






i yr 


I P 1 1 


Am 


VJ X u 


fi l v 
605 


^p r 


lie 


He 


va 1 


Tyr 


b 1 y 


bl U 

620 


i n r 


O r*v 

ber 


r ne 


T vrr 

Lys 


Ser 


Ser 


Ser 
635 


Asn 


Glu 


lie 


Lys 


Phe 
640 


Leu 


Thr 
650 


lie 


Thr 


Ser 


Glu 


Lys 
655 


Pro 


Ser 


Lys 


Glu 


lie 


Gin 


Val 


Glu 


Lys 


665 










670 






Lys 


lie 


Asn 


Gin 


Lys 
685 


lie 


Arg 


Gly 



<210> 6 
<2H> 2082 
<212> DMA 

<2)3> SuJ folobus solfataricus 

<4 00> 6 

aeggagacca tcaagatcta cgagaacaag ggcgtgtaca aggtggtgat cggcgagccg 60 

ttcccqccqa tcgagttccc gctcgagcag aagatctcct ccaacaagtc cctctccgag 120 

ccgggcccca ccatcgcgca gcagggcaac aaggtgatcg eggagaagtc cctcgacctc 180 

aaggagcaca ccatcggcct eggegagaag gcccccgagc tggaccgcaa gcgcaagcgc 240 

tacgtgacgt acaacgtgga cgccggcgcc tacaagaagt accaggaccc getctaegtg 300 

rccazcccqc tcttcatctc cgegaaggae ggcgcggcca ccggctactt cttcaactcc 360 

gcccccaagg tgatcttcga cgtgggcctc gaggagtacg acaaggegat cgtgaccatc 420 

ceggaggact ccgtggagtt ccacgtgatc gagggcccgc gcatcgagga cgtgctcgag 480 

aagtacaccg agetgacegg caagccgttc ctcccgccga tgtgggcctt eggctacatg 540 

atctcccgct actcctacta cccgcaggac aaggtggtgg agctggtgga catcatgeag 600 

aaggagggee tccgcgtggc cggcgtgctc ctcgacatcc actacatgga ctcctacaag 660 

ctctecacct ggcacccgta ccgcttcccg gagecgaaga agctcatcga cgagctgcac 720 

aagcgcaacg tgaagctcat caccaccgtg gaccacggca tccgcgtgga ccagaactac 780 

cccccgcccc tccccggcat gggcaagttc tgegagateg agtceggega gctgttcgtg 840 

gqcaagacgt ggccgggcac caccgtgtac ccggacttct tecgegagga cacccgcgag 900 

cggcgggccg gcctcacctc cgagtggctc tcccagggcg tggaeggcat ctggctcgac 960 

acgaacgagc cgaccgaccc cccccgcgcc atcgagatcc gcgacgtgct ctcctccccc 1020 

ccggtgcagt cccgcgacga ccgccccgtg accaccttcc eggacaaegt ggtgcactac 1080 

ctccgcggca agegegegaa gcacgagaag gtgcgcaacg cctacccgct etacgaggeg 1140 

acgqccaccc ccaagggcct ccgcacctcc caccgcaacg agaccttcac cctctcccgc 1200 

gccggctacg ccggcatcca gcgctacgcc ttcatctgga ccggcgacaa caccccgtcc 1260 

cgggacgacc ccaagcccca gctccagctc gtgctcggcc tctccacctc cggcgtgccg 1320 

tccgcgggcc gegacategg cggcttccag ggccgcaact tegecgagat cgacaacccg 1380 

atggaccccc tcgtgaagta ctacgccctc gccctcttct tcccgttcta ccgctcccac 1440 

aaggccaccg aeggcatega caccgagccg gtgttcctcc eggactacta caaggagaag 1500 

gcgaaggaga tcgtggagct gcgctacaag ttcctcccgt acatctactc cctcgccctc 1560 

gaggcctccg agaagggeca cccggtgatc cgcccgctct tctacgagtt ccaggacgac 1620 
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gacgacatgt accgcatcga ggacgagtac atggtgggca agtacctcct ctacgccccg 1680 

atcgtgtcca aggaggagtc ccgccccgcg accctcccgc gcggcaagtg gtacaactac 1740 

tggaacggcg agatcatcaa cggcaagtcc gtggtgaagt ccacccacga gctgccgatc 1800 

tacctccgcg agggctccat caccccgccc gagggcgacg agctgatcgt gtacggcgag 1860 

acccccttca agcgccacga caacgccgag atcacctcct cctccaacga gatcaagttc 1920 

ccccgcgaga tctacgtgtc caagctcacc atcacctccg agaagccggt gtccaagatc 1980 

atcgtggacg accccaagga gatccaggtg gagaagacca tgcagaacac ctacgtggcc 2040 

aaqatcaacc agaagacccg cggcaagatc aacctcgagt ga 2082 

<210> 7 
<211> 1818 
<212> DNA 

<2 1 3> Arcificial Sequence 

<220> 

<22 3> synthetic 
<400> 1 

atggcggctc tggccacgtc gcagctcgtc gcaacgcgcg ccggcctggg cgtcccggac 60 

gcccccacgc tcccccgcgg cgccgcgcag ggcctgaggg gggcccgggc gtcggcggcg 120 

gccgacocgc tcagcatgcg gaccagcgcg cgcgcggcgc ccaggcacca gcaccagcag 180 

gcgcgccgcg gggccaggtc cccgtcgctc gtcgtgtgcg ccagcgccgg catgaacgtc 240 

gccttcgticg gcgccgagac ggcgccgtgg agcaagaccg gaggcctcgg cgacgtcctc 300 

ggcggcctgc cgccggccat ggccgcgaac gggcaccgtg tcatggtcgt ctctccccgc 360 

tacgaccagt acaaggacgc ctgggacacc agcgtcgtgt ccgagatcaa gatgggagac 420 

gggcacgaga cggtcaggtt cttccactgc tacaagcgcg gagtggaccg cgtgttcgtt 480 

qaccacccac tgttcctgga gagggtttgg ggaaagaccg aggagaagat ctacgggcct 540 

gccgctggaa cgqactacag ggacaaccag ctgcggttca gcctgctatg ccaggcagca 600 

cctgaaqctc caaggatcct gagcctcaac aacaacccac acctctccgg accatacggg 660 

caggacgtcg tgttcgcctg caacgactgg cacaccggcc ctctctcgtg ctacctcaag 720 

agcaactacc agtcccacgg catctacagg gacgcaaaga ccgctttctg catccacaac 780 

azctcctacc agggccggtc cgccttcccc gactacccgg agctgaacct ccccgagaga 840 

tccaagtcgt ccttcgattt caticgacggc tacgagaagc ccgtggaagg ccggaagatc 900 

aactggatga aggccgggat cctcgaggcc gacagggtcc tcaccgtcag cccctactac 960 

gccgaggagc tcatccccgg catcgccagg ggctgcgagc tcgacaacat catgcgcctc 1020 

accggcatca ccggcatcgt caacggcatg gacgtcagcg agtgggaccc cagcagggac 1080 

aagtacatcg ccgtgaagta cgacgtgtcg acggccgtgg aggccaaggc gctgaacaag 1140 

gaggcgctgc aggcggaggt cgggctcccg gtggaccgga acatcccgct ggtggcgttc 1200 

atcggcaggc tggaagagca gaagggcccc gacgtcatgg cggccgccat cccgcagctc 1260 

atggagatgg tggaggacgt gcagatcgtt ctgctgggca cgggcaagaa gaagttcgag 1320 

cgcatgctca tgagcgccga ggagaagttc ccaggcaagg tgcgcgccgt ggtcaagttc 1380 

aacgcggcgc tggcgcacca catcatggcc ggcgccgacg tgctcgccgt caccagccgc 1440 

ttcgagcccc gcggcctcat ccagctgcag gggatgcgat acggaacgcc ctgcgcctgc 1500 

gcgcccaccg gtggactcgt cgacaccacc atcgaaggca agaccgggtt ccacatgggc 1560 

cgcctcagcg tcgactgcaa cgtcgtggag ccggcggacg tcaagaaggt ggccaccacc 1620 

ccgcagcgcg ccatcaaggt ggtcggcacg ccggcgtacg aggagatggt gaggaactgc 1680 

acgatccagg atctctcctg gaagggccct gccaagaact gggagaacgt gctgctcagc 1740 

cccggggtcg ccggcggcga gccagcggtt gaaggcgagg agatcgcgcc gctcgccaag 1800 

gagsacgtgg ccqcqccc 1818 

<210> S 
<211> 606 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
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<400> 8 

Met Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly Leu 

i 5 10 15 

GJy Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly Leu 

20 25 30 

Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg Thr 

35 40 45 

Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg Gly 

50 55 60 

AJa Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Met Asn Val 
65 70 75 ' 80 

Val Phe Val Gly Ala Glu Met Ala Pro Trp Ser Lys Thr Gly Gly Leu 

85 90 95 

Gly Asp Val Leu Gly Gly Leu Pro Pro Ala Met Ala Ala Asn Gly His 

100 105 110 

Arg Val. Met Val Val Ser Pro Arg Tyr Asp Gin Tyr Lys Asp Ala Trp 

115 120 ^ 125 

Asp Thr Ser Val Val Ser Glu lie Lys Met Gly Asp Gly Tyr Glu Thr 

130 135 140 

Val Arg Phe Phe His Cys Tyr Lys Arg Gly Val Asp Arg Val Phe Val 
145 150 155 160 

Asp His Pro Leu Phe Leu Glu Arg Val Trp Gly Lys Thr Glu Glu Lys 

165 170 175 

He Tyr Gly Pro Val Ala Gly Thr Asp Tyr Arg Asp Asn Gin Leu Arg 

180 185 190 

Phe Ser Leu Leu Cys Gin Ala Ala Leu Glu Ala Pro Arg lie Leu Ser 

195 200 205 

Leu Asn Asn Asn Pro Tyr Phe Ser Gly Pro Tyr Gly Glu Asp Val Val 

210 215 220 

Phe Val Cys Asn Asp Trp His Thr Gly Pro Leu Ser Cys Tyr Leu Lys 
225 230 235 240 

Ser Asn Tyr Gin Ser His Gly He Tyr Arg Asp Ala Lys Thr Ala Phe 

245 250 255 

Cys lie His Asn lie Ser Tyr Gin Gly Arg Phe Ala Phe Ser Asp Tyr 

260 265 270 

Pro Glu Leu Asn Leu Pro Glu Arg Phe Lys Ser Ser Phe Asp Phe lie 

275 280 285 

Asp Gly Tyr Glu Lys Pro Val Glu Giy Arg Lys lie Asn Trp Met Lys 

290 295 * 300 

Ala Gly lie Leu Glu Ala Aso Arg Val Leu Thr Val Ser Pro Tyr Tyr 
305 310 * 315 320 

Ala Glu Glu Leu lie Ser Gly He Ala Arg Gly Cys Glu Leu Asp Asn 

325 330 335 

He Met Arg Leu Thr Gly lie Thr Gly He Val Asn Gly Met Asp Val 

340 345 350 

Ser Glu Trp Asp Pro Ser Arg Asp Lvs Tvr lie Ala Val Lys Tyr Asp 

355 360 365 

Val Ser Thr Ala Val Glu Ala Lys Ala Leu Asn Lys Glu Ala Leu Gin 

370 375 380 

Aia Glu Val Gly Leu Pro Val Asp Arg Asn He Pro Leu Val Ala Phe 
335 390 395 400 

lie Gly Arg Leu Glu Glu Gin Lys Gly Pro Asp Val Met Ala Ala Ala 

405 410 415 

He Pro Gin Leu Met Glu Met Val Glu Asp Val Gin He Val Leu Leu 

420 425 430 

Gly Thr Gly Lys Lys Lys Phe Glu Arg Met Leu Met Ser Ala Glu Glu 

435 440 445 

Lys Phe Pro Giy Lys Val Arg Ala Val Val Lys Phe Asn Ala Ala Leu 
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450 455 460 



Ala 


His 


His 


He 


Met 


Ala 


Gly 


Ala 


Asp 


Val 


Leu 


Ala 


Val 


Thr 


Ser 


Arg 


465 










470 










475 










480 


?he 


Glu 


Pro 


Cys 


Gly 
485 


Leu 


He 


Gin 


Leu 


Gin 
490 


Gly 


Met 


Arg 


Tyr 


Gly 
495 


Thr 


Pro 


Cys 


Ala 


Cys 
500 


Ala 


Ser 


Thr 


Gly 


Gly 
505 


Leu 


Val 


Asp 


Thr 


He 
510 


He 


Glu 


G] y 


Lys 


Thr 
515 


Gly 


Phe 


His 


Met 


Gly 
520 


Arg 


Leu 


Ser 


Val 


Asp 
525 


Cys 


Asn 


Val 


Val 


Glu 
530 


Pro 


Ala 


Asp 


Val 


Lys 
535 


Lys 


Val 


Ala 


Thr 


Thr 
540 


Leu 


Gin 


Arg 


Ala 


!le 


Lys 


Vci 


Val 


Gly 


Thr 


Pro 


Ala 


Tyr 


Glu 


Glu 


Met 


Val 


Arg 


Asn 


Cys 


54 5 








550 










555 










560 


Met 


lie 


Gin 


Asp 


Leu 
565 


Ser 


Trp 


Lys 


Gly 


Pro 
570 


Ala 


Lys 


Asn 


Trp 


Glu 
575 


Asn 


Val 


Leu 


Leu 


Ser 
580 


Leu 


Gly 


Val 


Ala 


Gly 
585 


Gly 


Glu 


Pro 


Gly 


Val 
590 


Glu 


Gly 


Glu 


Glu 


He 
595 


Ala 


Pro 


Leu 


Ala 


Lys 
600 


Glu 


Asn 


Val 


Ala 


Ala 
605 


Pro 







<2\0> 9 

<2 1 1 > 2223 

<212> DMA 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 9 

atggccaagt acctggagct 
gtcccgagcg gaggcatctg 
gccggcatct ccgcgatctg 
atgggctacg acccgtacga 
gagacgcgct tcgggtccaa 
agcatcaagg ccatcgcgga 
aacccgtccg tcggcgacta 
accgccaact acctcgactt 
ggcggctacc cggacatctg 
caggagtccc acgcggccta 
gtcaagggct acggggcctg 
gcgggcgagt actgggacac 
gccaaggtgc tcgacttccc 
atcccggcgc tcgtcgaggc 
aaggccgtga ccttcgtcgc 
tacgccttca tcctcaccta 
tggctgaaca aggacaagct 
tccactagta tcgtgtacta 
cccaagcccg ccctgatcac 
tacgccccga agctcgccgg 
ctggacaagt acqtgtactc 
gccaacggcc agtacggcta 
ggcatcctcg aggccgacag 
cccggcatcg ccaggggctg 
atcgtcaacg gcatggacgt 
aagcacgacg cgtcgacggc 
gagctcgogc tcccggtgga 
gaccagaagg cccccgacgt 



ggaggagggc ggcgtgatca 
gtgggacacc atccgccaga 
gataccgcca gcttccaagg 
ctacttcgac ctcggcgagt 
gcaggagctc atcaacatga 
catcgtgatc aaccacaggg 
cacctggacg gacttctcca 
ccaccccaac gagctgcacg 
ccacgacaag tcctgggacc 
cctgcgctcc atcggcatcg 
ggtggtcaag gactggctca 
caacgtcgac gcgctgctca 
cctgtactac aagacggacg 
cctgaagaac ggcggcacgg 
caaccacgac acgqacatca 
cgagggccag cccacgatct 
caagaacctg atctggattc 
cgactccgac gagatgatct 
gtacatcaac ctgggctcct 
cgcgtgcatc cacgagtaca 
ctccggctgg gtctacctgg 
ctccgcgcgg ccctactgcg 
ggccctcacc gtcagcccct 
cgagctcgac aacatcatgc 
cagcgagtgg gaccccagca 
cgtggaggcc aaggcgctga 
ccggaacatc ccgctggtgg 
catggcggcc gccatcccgc 



tgcaggcgtt ctactgggac 60 
agatccccga gtggtacgac 120 
gcatgtccgg gggctactcg 180 
actaccagaa gggcacggtg 240 
tcaacacggc gcacgcctac 300 
ccggcggcga cctggagtgg 360 
aggtcgcctc cggcaagtac 420 
cgggcgactc cggcacgttc 480 
agtactggct ctgggcctcg 540 
acgcgtggcg cttcgactac 600 
actggtgggg cggctgggcg 660 
actgggccta ctcctccggc 720 
cggccttcga caacaagaac 780 
tggtctcccg cgacccgttc 840 
tctggaacaa gtacccggcg 900 
tctaccgcga ctacgaggag 960 
acgacaacct cgcgggcggc 1020 
tcgtccgcaa cggctacggc 1080 
ccaaggtggg ccgctgggtg 1140 
ccggcaacct cggcggctgg 1200 
aggccccggc ctacgacccc 1260 
gcgtcggcac atcgattgct 1320 
actacgccga ggagctcatc 1380 
gcctcaccgg catcaccggc 1440 
gggacaagta catcgccgtg 1500 
acaaggaggc gctgcaggcg 1560 
cgttcatcgg caggctggaa 1620 
agctcatgga gatggtggag 1680 
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gacgtgcaga tcgttctgct gggcacgggc aagaagaagt tcgagcgcat gctcatgagc 1740 

gccgaggaga agttcccagg caaggtgcgc gccgtggtca agttcaacgc ggcgctggcg 1800 

caccacacca tggccggcgc cgacgcgctc gccgtcacca gccgcttcga gccctgcggc 1860 

ctcatccagc tgcaggggat gcgatacgga acgccctgcg cctgcgcgtc caccggcgga 1920 

cccgtcgaca ccatcatcga aggcaagacc gggttccaca tgggccgcct cagcgtcgac 1980 

tgcaacatcg cggagccggc ggacgccaag aaggtggcca ccaccttgca gcgcgccatc 2040 

aaggtggtcg gcacgccggc gtacgaggag atggtgagga accgcatgat ccaggatctc 2100 

tcctggaagg gccctgccaa gaactgggag aacgtgctgc tcagcctcgg ggtcgccggc 2160 

ggcgagccag gggttgaagg cgaggagacc gcgccgctcg ccaaggagaa cgtggccgcg 2220 



<210> 10 

<21!> 741 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<*3 00> 10 



Met 


Ala 


Lvs 


Tyr 


Leu 


Glu 


Leu 


Glu Glu Gly Gly 


Val 


lie 


Met 


Gin 


Ala 


1 




5 










10 










15 




Phe 


Tyr 


Trp 


Asp 


Vai 


Pro 


Ser 


Gly 


Gly 


lie 


Trp 


Trp 


Asp 


Thr 


lie 


Arg 




20 










25 










30 






G: n 


I \; c 


He 


Pro 


Glu 


Trp 


Tyr 


Asp 
40 


Ala 


Gly 


Tie 


Ser 


Ala 
45 


lie 


Trp 


lie 


Pro 


Pro 


Ala 


Ser 


Lys 


Glv 


Met 


Ser 


Gly 


Gly 


Tyr 


Ser 


Met 


Gly 


Tyr 


Asp 




50 








55 










60 










Pro 


Tyr 


Asp 


Tyr 


Phe 


ASD 

i 


Leu 


Gly 


Glu 


Tyr 


Tyr 


Gin 


Lys 


Gly 


Thr 


Val 


65 






70 










75 










80 


Glu 


Thr 


Arg 


Phe 


Gl y 


Ser 


Lys 


Gin 


Glu 


Leu 


lie 


Asn 


Met 


lie 


Asn 


Thr 








S5 










90 










95 




Ala 


His 


Ala 


Tyr 
100 


Gly 


lie 


Lys 


Vai 


lie 
105 


Ala 


Asp 


lie 


Val 


He 
110 


Asn 


His 


Arc 


Ala 


Gly 


Gly 


Asp 


Leu 


Glu 


Trp 


Asn 


Pro 


Phe 


Val 


Gly 


Asp 


Tyr 


Thr 






115 






120 










125 








Trp 


Thr 


Asp 


Phe 


Ser 


Lys 


Vai 


Ala 


Ser 


Gly 


Lys 


Tyr 


Thr 


Ala 


Asn 


Tyr 


1 30 






135 










140 










Leu 


Asp 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly 


Asp Ser Gly 


Thr 


Phe 


1-15 








150 










155 










160 


Gly 


Gly 


Tyr 


Pro 


Asp 


lie 


Cys 


His 


Asp 


Lys 


Ser 


Trp Asp Gin 


Tyr 


Trp 




165 










170 










175 




Leu 


Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


lie 


Gly 






180 










185 










190 






lie 


Asp 


Ala 


Trp 


Arg 


Phe 


Asp 


Tyr 


Vai 


Lys 


Gly 


Tyr 


Gly Ala 


Trp Val 




195 










200 










205 








Vai 


Lys 


Asp 


Trp 


Leu 


Asn 


Trp 


Trp Gly 


Gly 


Trp 


Ala 


Val 


Gly 


Glu 


Tyr 




210 








215 










220 










Trp 


Asp 


Thr 


Asn 


Vai 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 


Ser 


Gly 


225 








230 










235 










240 


Ala 


Lys 


Vai 


Phe 


AoD 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 


Ala 


Phe 








245 










250 










255 




■ 


Asn 


Lys 


Asn 
260 


lie 


Pro 


Ala 


Leu 


Vai 
265 


Glu 


Ala 


Leu 


Lys 


Asn 
270 


Gly 


Gly 


Thr 


Vai 


Vai 


Ser 


Arg 


Asp 


Pro 


Phe 


Lys 


Ala 


Vai 


Thr 


Phe 


Val 


Ala 


Asn 






275 








280 










285 








H is 


Asp 
290 


Thr 


Asp 


lie 


I le 


Trp 
295 


Asn 


Lys 


Tyr 


Pro 


Ala 
300 


Tyr 


Ala 


Phe 


lie 
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Leu 


Thr 


Tyr 


Glu 


Gly 


Gin 


Pro 


Thr 


305 










310 






Trp 


Leu 


Asn 


Lys 


Asp 


Lys 


Leu 


Lys 










325 








Leu 


Ala 


Giy 


Gly 


Ser 


Thr 


Ser 


lie 






340 










lie 


Fhe 


Val 


Arg 


Asn 


Giy 


Tyr 


Gly 






355 










360 


He 


Asn 


Leu 


Giy 


Se r 


Ser 


Lys 


Val 




3"?0 










37 5 




?he 


Aia 


Gly 


A.) a 


Cys 


lie 


His 


Glu 


385 










390 






Val 


Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 










405 








Ala 


Tyr 


Asp 


Pro 


Aia 


Asn 


Gly 


Gin 








420 










Cvs 

* 


Giy 


Val 


Gly 


Thr 


Ser 


lie 


Ala 










440 


Leu 


Thr 


vai 


Ser 


Pro 


Tyr 


Tyr 


Ala 




4 50 










455 




Ar q 


Giy 


Cys 


Glu 


Leu 


Asp 


Asn 


lie 


4 65 










470 






Tie 


Val 


Asn 


Gly 


Met 


Asp 


Val 


Ser 










485 








T v r 


lie 


Ala 


Vai 


Lys 


Tyr 


Asp 


Val 






500 










Leu 


Asn 


Lys 


Glu 


Ala 


Leu 


Gin 


Ala 






515 










520 


Asn 


lie 


Pro 


Leu 


Val 


Ala 


Phe 


lie 




530 










535 




Pro 


Asp 


Val 


Met 


Aia 


Ala 


Ala 


lie 


54 5 








550 






Asp 


Val 


Gin 


lie 


Val 


Leu 


Leu 


Gly 








565 








Met 


Leu 


Met 


Ser 


Ala 


Glu 


Giu 


Lys 








580 










Val 


Lys 


Phe 


Asn 


Ala 


Ala 


Leu 


Ala 






595 










600 


val 


Leu 


Ala 


Val 


Thr 


Ser 


Arg 


Phe 




610 










615 




Gin 


Gly 


Met 


Arg 


Tyr 


Gly 


Thr 


Pro 


625 










630 






Leu 


Val 


Asp 


Thr 


He 


lie 


Glu 


Gly 










645 








Leu 


Ser 


Val 


Asp 


Cys 


Asn 


Val 


Val 








660 










Ala 


Thr 


Thr 


Leu 


Gin 


Arg 


Ala 


lie 






675 










680 


Glu 


Glu 


Met 


val 


Ara 


Asn 


Cys 


Met 




690 










695 




Fro 


Aia 


Lvs 

• 


Asn 


Trp 


Glu 


Asn 


Val 


705 








710 






Gly 


Giu 


Pro 


Gly 


Val 


Glu 


Gly 


Glu 








725 








Asn 


Val 


Ala 


Ala 


Pro 









740 



lie 


Phe 


Tyr 


Arg 


Asp 


Tyr- 


Glu 


Glu 






315 










320 


Asn 


Leu 


lie 


Trp 


lie 


His 


Asp 


Asn 




330 








335 




Val 


Tyr 


Tyr 


Asp 


Ser 


Asp 


Glu 


Met 


345 










350 






Ser 


Lys 


Pro 


Gly 


Leu 


He 


Thr 


Tyr 










365 








Gly 


Arg 


Trp 


Val 


Tyr 


Val 


Pro 


Lys 








380 










Tyr 


Thr 


Gly 


Asn 


Leu 


Gly 


Gly 


Trp 






395 










400 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


Ala 


Pro 




410 










415 




Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 


Ser 


Tyr 


425 










430 






Gly 


lie 


Leu 


Glu 


Ala 


Asp Arg 


Val 










445 








Glu 


Glu 


Leu 


lie 


Ser Gly 


He 


Ala 








4 60 










Met 


Arg 


Leu 


Thr 


Gly 


He 


Thr 


Gly 






475 










480 


Glu 


Trp 


Asp 


Pro 


Ser 


Arg 


Asp 


Lys 




490 










495 




Ser 


Thr 


Aia 


Val 


Glu 


Ala 


Lys 


Ala 


505 










510 






Glu 


Val 


Gly 


Leu 


Pro 


Val 


Asp 


Arg 










525 








Gly 


Arg 


Leu 


Glu 


Glu 


Gin 


Lys 


Gly 








540 










Pro 


Gin 


Leu 


Met 


Glu 


Met 


Val 


Glu 






555 










560 


Thr 


Gly 


Lys 


Lys 


Lys 


Phe 


Glu 


Arg 




570 










575 




Phe 


Pro 


Gly 


Lys 


Val 


Arg 


Ala 


Val 


5S5 










590 






His 


His 


lie 


Met 


Ala 


Gly Ala 


Asp 










605 








Glu 


Pro 


Cys 


Gly 


Leu 


He 


Gin 


Leu 








620 










Cys 


Ala 


Cys 


Ala 


Ser 


Thr 


Gly 


Gly 




635 










640 


Lys 


Thr 


Gly 


Phe 


His 


Met 


Gly 


Arg 




650 










655 




Glu 


Pro 


Ala 


Asp 


Val 


Lys 


Lys 


Val 


665 










670 






Lys 


val 


Val 


Gly 


Thr 


Pro 


Ala 


Tyr 










685 








lie 


Gin 


Asp 


Leu 


Ser 


Trp 


Lys 


Gly 








700 










Leu 


Leu 


Ser 


Leu 


Giy 


Val 


Ala 


Gly 






715 










720 


Glu 


lie 


Ala 


Pro 


Leu 


Ala 


Lys 


Glu 




730 










735 
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<210> 11 
<211> 15)5 
<2}2> DNA 
<21 3> Zea mays 

<400> 1: 

'jgagagccat gagacgtacg cccccaaagc cactttgcat tgtgtgaaac caatatcgat 60 

cittgttact ccaccatgca tgaacatttg tggaaactac tagcttacaa gcattagtga 120 

cagctcagaa aaaacctatc tatgaaaggt ttcatgtgta ccgtgggaaa tgagaaatgt 180 

tgccaactca aacaccttca atatgttgtt tgcaggcaaa ctcctctgga agaaaggtgt 240' 

ctaaaactat gaacgggtta cagaaaggta taaaccacgg ctgtgcattt tggaagtatc 300 

atctatagat gtctgttgag gggaaagccg tacgccaacg ttatttactc agaaacagct 360 

ccaacacaca gttgtctgct ttatgatggc atctccaccc aggcacccac catcacctat 420 

ctctcgtgcc tgtttatttt cttgcccttt ctgatcataa aaaaacatta agagtttgca 480 

aacatgcata ggcatatcaa tatgctcatt tattaatttg ctagcagatc atcttcctac 540 

tctttacctc atttattgtt tgaaaaatat gtcctgcacc tagggagctc gtatacagta 600 

ccaatgcatc ctcattaaat gtgaatttca gaaaggaagt aggaacctat gagagtattt 660 

ttcaaaatta actagcggct tctattatgt ttatagcaaa ggccaagggc aaaattggaa 720 

-accaatgat ggttggttgc atgagtctgt cgattacttg caagaaatgt gaacctttgt 780 

itctgtgcgt gggcataaaa caaacagctt ctagcctctt ttacggtact tgcacttgca 840 

agaaatgtca actccctttc atttctgtat gtggacataa tgccaaagca tccaggcttt 900 

ttcatggttg cigatgtctc tacacagttc atctccacca gtatgccctc ctcatactct 960 

acataaacac atcaacagca tcgcaattag ccacaagatc acttcgggag gcaagtgcga 1020 

Utcgatctc gcagccaccc ttttttgttc tgttgtaagt ataccttccc ttaccatctt 1080 

tatccgttag titaacttgt aattgggaag cattagtgga aagaggatga gatgctatca 1140 

cctacgtact ctgcaaatgc atctgacgtt atatgggctg cttcatataa tttgaattgc 1200 

cccattcttg ccgacaatat attgcaaggt atatgcctag ttccatcaaa agttctgttt 1260 

Utcattcca aaagcatttt agcggcacac aatttttgtc catgagggaa aggaaatctg 1320 

ttttggttac tttgcttgag .gtgcattct t catatgtcca gttttatgga agtaataaac 1380 

ttcagtttgg tcataagatg tcatatcaaa gggcaaacat atattcaatg ttcaattcat 1440 

cgcaaatgtc ccctttttgt aaaagattgc atactcattt atttgagttg caggtgtatc 1500 

cagtagttgg aggag 1515 

<210> 12 
<211> 6*73 
<212> DNA 
<213> Zea mays 

<400> 12 

gatcatccag gtgcaaccgt ataagtccta aagtggtgag gaacacgaaa caaccatgca 

ttggcatgta aagccccaag aatttgtcgt atccttaaca actcacagaa catcaaccaa 

aattgcacgt caagggtatt gggtaagaaa caatcaaaca aatcctctct gtgtgcaaag 

aaacacggtg agicatgccg agatcatact catctgatat acatgcttac agctcacaag 

acaziacaaa caactcatat cgcattacaa agatcgtttc atgaaaaata aaataggccg 300 

gacaggacaa aaacccctga cgcgtaaagt aaatttacaa caaaaaaaaa gccatatgtc 360 

aagctaaatc taattcgttt tacgtagatc aacaacctgt agaaggcaac aaaactgagc 420 

cacgcagaag cacagaatga ttccagatga accatcgacg tgctacgtaa agagagtgac 480 

gagtcatata catttggcaa gaaaccatga agccgcctac agccgtctcg gtggcataag 540 

aacacaaqaa attgtgttaa ccaaccaaag ctataaataa cgctcgcatg cctgtgcact 600 

tctccaccac caccactggg ccttcagacc attagcttta tctactccag agcgcagaag 660 

aacccgaccg aca 

<2i0> 13 
<211> 4 54 
<212> PRT 

< 2 1 3> Artificial Sequence 
<220> 



60 
120 
180 
240 
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< 22 3 > synthetic 



<<;oo> n 



Met 


A r cj 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


3 
















10 










15 




Ala 


Thr 


Ser 


Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu 


Glu 


Gly 


Gly 


Val 


lie 


Met 








20 








25 










30 






Gin 


Ala 


Phe 


Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 


lie 


Trp 


Trp 


Asp 


Thr 






35 










40 










45 








Lie 


Ara 


Gin 


Lys 


lie 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala 


Gly 


lie 


Ser 


Ala 


lie 




50 










55 










60 










T r d 


Tie 


Pre 


Pro 


Ala 


Ser 


Lys 


Gly 


Met 


Ser 


Gly 


Gly 


Tyr 


Ser 


Met 


Gly 


65 










70 










75 










80 


Tut 

d 


Asp 


Pre 


Tyr 


Asp 


Tyr 


Phe 


Asp 


Leu 


Gly 


Glu 


Tyr 


Tyr 


Gin 


Lys 


Gly 










85 










90 










95 




Thr 


Val 


Glu 


Thr 


Arg 


Phe 


Gly 


Ser 


Lys 


Gin 


Glu 


Leu 


lie 


Asn 


Met 


lie 








100 








105 










110 






Asn 


Thr 


Ala 


His 


Ala 


Tyr 


Gly 


lie 


Lys 


Val 


lie 


Ala 


Asp 


lie 


Val 


lie 






115 










120 










125 








Asn 


His 


Arg 


Ala 


Gly 


Gly 


Asp 


Leu 


Glu 


Trp 


Asn 


Pro 


Phe 


val 


Gly 


Asp 




130 










135 










140 










Tvr 


Thr 


Trp 


Thr 


Asp 


Phe 


Set- 


Lys 


Val 


Ala 


Ser 


Gly 


Lys 


Tyr 


Thr 


Ala 


145 










150 










155 










160 


Asn 


Tyr 


Leu 


Asp 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly 


Asp 


Ser 


Gly 










165 










170 










175 




Thr 


Phe 


Gly 


Gly 


Tyr 


Pro 


Asp 


lie 


Cys 


His 


Asp 


Lys 


Ser 


Trp 


Asp 


Gin 








180 










185 










190 






Tyr 


Trp 


Leu 


Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


195 








200 










205 








] le 


G 1 y 


lie 


Asp 


Ala 


Trp 


Arg 


Phe 


Asp 


Tyr 


Val 


Lys 


Gly 


Tyr 


Gly 


Ala 




210 










215 










220 










Trp 


Val 


Val 


Lys 


Asp 


Trp 


Leu 


Asn 


Trp 


Trp 


Gly 


Gly 


Trp 


Ala 


Val 


Gly 


225 










230 










235 










240 


Glu 


Tvr 


Trp 


Asp 


Thr 


Asn 


Vai 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 








205 










250 










255 




Ser 


G) y 


Ala 


LVS 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 








2 60 










265 










270 








Phe 


Asp 


Asn 


Lys 


Asn 


lie 


Pro 


Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn 






275 










280 










285 








Gly 


Gly 


Thr 


Val 


Vai 


Ser 


Arg 


Asp 


Pro 


Phe 


Lys 


Ala 


Val 


Thr 


Phe 


Val 


290 










295 










300 










Ala 


A s n 


His 


Asp 


Thr 


Asp 


lie 


lie 


Trp 


Asn 


Lys 


Tyr 


Pro 


Ala 


Tyr 


Ala 


305 








310 










315 










320 


Phe 


He 


Leu 


Thr 


Tyr 


Glu 


Gly 


Gin 


Pro 


Thr 


lie 


Phe 


Tyr 


Arg 


Asp 


Tyr 










325 










330 










335 




Glu 


Glu 


Trp 


Leu 


Asn 


Lys 


Aso 


Lys 


Leu 


Lys 


Asn 


Leu 


lie 


Trp 


lie 


His 






340 








345 










350 






Asp 


Asn 


Leu 


Ala 


Gly 


Gly 


Ser 


Thr 


Ser 


lie 


Val 


Tyr 


Tyr 


Asp 


Ser 


Asp 




355 










360 










365 








Glu 


Met 


lie 


Phe 


Val 


Arg 


Asn 


Gly 


Tyr 


Gly 


Ser 


Lys 


Pro 


Gly 


Leu 


lie 




370 










375 










380 










Thr 


Tvr 

m 


lie 


Asn 


Leu 


Gly 


Ser 


Ser 


Lys 


Val 


Gly 


Arg 


Trp 


Val 


Tyr 


Val 


335 








390 










395 










400 


Fro 


Lys 


Phe 


Ala 


Gly 


Ala 


Cys 


lie 


His 


Glu 


Tyr 


Thr 


Gly 


Asn 


Leu 


Gly 










405 










410 










415 




Gly 


Trp 


Val 


Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 








420 










425 










430 






Ala 


Pro 


Ala 


Tyr 


Asp 


Pro 


Ala 


Asn 


Gly 


Gin 


Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 
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4 35 4 40 

Ser Tyr Cys Giy Val Giy 
4 50 



<210> 14 

<211> 460 

<212> PRT 

<213> Artificial Sequence 
<220> 

<" 2 2 3> synthetic 



<400> 14 



Met 


A r q 


Val 


Leu 


Leu 


val 


Ala 


Leu 


Ala 


Leu 


i 








5 










10 


Ai a 


Thr 


Ser 


Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu 








20 








25 




Gin 


A: a 


Phe 


Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Giy 






35 










40 






He 


Arg 


Gin 


Lys 


F ie 


Pro 


Glu 


Trp 


Tyr 


Asp 




50 










55 








Tro 


lie 


Pro 


Pro 


Ala 


Ser 


Lys 


Giy 


Met 


Ser 


65' 










70 








Tyr 


Asp 


Pro 


Tyr 


Aso 


Tyr 


Phe 


Asp 


Leu 


Giy 










85 










90 


Thr 


Val 


Glu 


Thr 


Arg 


Phe 


Giy 


Ser 


Lys 


Gin 








100 










105 




Asn 


Thr 


Ala 


His 


Ala 


Tyr 


Giy 


lie 


Lys 


Val 






115 










120 








His 


Arg 


Ala 


Giy 


Giy 


Asp 


Leu 


Glu 


Trp 




130 










135 








i yr 


Thr 


Trp 


Thr 


Asp 


Phe 


Ser 


Lys 


Val 


Ala 


145 










150 










Asn 


Tyr 


Leu 


Aso 

4 


Phe 


His 


Pro 


Asn 


Glu. 


Leu 








165 










170 


Thr 


Phe 


Giy 


Giy 


Tyr 


Pro 


Asp 


lie 


Cys 


His 








180 










185 




Tyr 


Trp 


Leu 


Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 






195 










200 






lie 


Giy 


Tie 


Asp 


Ala 


Trp 


Arg 


Phe 


Asp Tyr 




210 










215 








Trp 


Val 


Val 


Lys 


Asp 


Trp 


Leu 


Asn 


Trp Trp 


22b 










230 










Glu 


• j - 


Trp 


Asp 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 










245 










250 


Ser 


Glv 


Ala 


Lys 


Val 


Phe 


As D 

* 


Phe 


Pro 


Leu 








2 60 










265 




Ala 


Phe 


Asp 


Asn 


Lys 


Asn 


lie 


Pro 


Ala 


Leu 






27 5 










280 






Glv 


Giy 


Thr 


Val 


Val 


Ser 


Arg 


Asp 


Pro 


Phe 




290 










295 








Ala 


Asn 


His 


Asp 


Thr 


Asp 


lie 


lie 


Trp 


Asn 


30 5 










310 










Phe 


T le 


Leu 


Thr 


Tyr 


Glu 


Giy 


Gin 


Pro 


Thr 










325 










330 


Glu 


Glu 


Trp 


Leu 


Asn 


Lys 


Asp 


Lys 


Leu 


Lys 








340 










345 
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445 



Leu 


Ala 


Leu 


Ala 


Ala 
15 


Ser 


Glu 


Giy 


Giy 


Val 
30 


lie 


Met 


Giy 


lie 


Trp 
45 


Trp 


Asp 


Thr 


Ala 


Giy 
60 


lie 


Ser 


Ala 


lie 


Giy 


Giy 


Tyr 


Ser 


Met 


Giy 


75 










80 


Glu 


Tyr 


Tyr 


Gin 


Lys 
95 


Giy 


Glu 


Leu 


lie 


Asn 
110 


Met 


lie 


lie 


Ala 


Asp 
125 


lie 


Val 


lie 


Asn 


Pro 
140 


Phe 


val 


Giy 


Asp 


Ser 


Giy 


Lys 


Tyr 


Thr 


Ala 


155 










160 


His 


Ala 


Giy Asp 


Ser 


Giy 










175 




Asp 


Lys 


Ser 


Trp 
190 


Asp 


Gin 


Ala 


Ala 


Tyr 
205 


Leu 


Arg 


Ser 


Val 


Lys 


Giy Tyr Giy 


Ala 




220 










Giy 


Giy 


Trp Ala 


Val 


Giy 


235 










240 


Leu 


Asn 


Trp Ala 


Tyr 


Ser 










255 




Tyr 


Tyr 


Lys 


Met 
270 


Asp 


Ala 


Val 


Glu 


Ala 
235 


Leu 


Lys 


Asn 


Lys 


Ala 
300 


Val 


Thr 


Phe 


Val 


Lys 


Tyr 


Pro 


Ala 


Tyr 


Ala 


315 










320 


lie 


Phe 


Tyr 


Arg 


Asp 
335 


Tyr 


Asn 


Leu 


lie 


Trp 


lie 


His 



350 
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Asp 


Asn 


Leu 


Ala 


Glv 


Gly 


Ser 


Thr 


Ser 


lie 






355 








360 






Glu 


Met 


He 


Phe 


Val 


Arg 


Asn 


Gly 


Tyr 


Gly 




370 










375 








Thr 


Tvr 

* 


lie 


Asn 


Leu 


Gly 


Ser 


Ser 


Lys 


Val 


335 








390 










Pro 


Lvs 


Phe 


Ala 


Gly 


Ala 


Cys 


lie 


His 


Glu 










405 










410 


Gly 


Trp 


Val 


Asp 


Lys 


Tyr 

* 


Val 


Tyr 


Ser 


Ser 








420 










425 




Ala 


Pro 


Ala 


Tyr 


Asp 


Pro 


Ala 


Asn 


Gly 


Gin 






435 








4 4 0 






Ser 


Tyr 


Cys 


Giy 


Val 


Gly 


Ser 


Glu 


Lys 


Asp 




0 50 










455 








<21< 


0> 15 
















<2l 


1> 5 


13 
















<212> PRT 
















<213> A 


rtificial Sequence 








<220> 


















<22 


3> 5' 


ynt het ic 














<400> 15 
















Met 


Leu 


Ala 


Mid 


Leu 


Ala 


Thr 


Ser 


Gin 


Leu 










5 










10 


Leu 


Gly 


Val 


Pro 


Aso 

» 


Ala 


Ser 


Thr 


Phe 


Arg 






20 








25 




Leu 


Arg 


G 1 y 


Ala 


Arq 


Ala 


Ser 


Ala 


Ala 


Ala 




35 










40 






Thr 


Ser 


Ala 


Arg 


Ala 


Ala 


Pro 


Arg 


His 


Gin 




50 










55 








Gly 


Ala 


Arg 


Phe 


Pro 


Ser 


Leu 


Val 


Val 


Cys 


65 








70 










Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu 


Glu 


Gly 


Gly 






35 










90 


Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 


lie 


Trp 




100 










105 




Lys 


lie 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala 


Gly 


lie 




115 










120 






Pro 


r. 1 a 


Ser 


Lys 


Gly 


Met 


Ser 


Gly 


Giy 


Tyr 




130 










135 








Tyr 


Asp 


Tyr 


Phe 


Asp 


Leu 


Gly 


Glu 


Tyr 


Tyr 


145 










150 










Thr 


Arg 


Phe 


Gly 


Ser 


Lys 


Gin 


Glu 


Leu 


lie 








i 65 










170 


His 


Ala 


Tvr 

•4 


Glv 


lie 


Lys 


Val 


lie 


Ala 


Asp 






180 










185 




Ala 


Gly 


Gly 


Asp 


Leu 


Glu 


Trp 


Asn 


Pro 


Phe 




195 










200 






Thr 


Aso 


Phe 


Ser 


Lys 


Val 


Ala 


Ser 


Gly 


Lys 




210 










215 








Asp 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly 


225 










230 










Gly 


T«i r 
• j - 


Pro 


Asp 


I ie 


Cys 


His 


Asp 


Lys 


Ser 








245 










250 


Tro 

* 


A 1 a 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 
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Val 


Tyr 


Tyr 


Asp 


Ser 


Asp 






365 


I 






Ser 


Lys 


Pro 


Gly 


Leu 


He 




380 










Gly 


Arg 


Trp 


Val 


Tyr 


Val 


395 










400 


Tyr 


Thr 


Gly 


Asn 


Leu 


Gly 










415 




Gly 


Trp 


Val 


Tyr 


Leu 


Glu 








4 30 






Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 






445 








Glu 


Leu 












460 










Val 


Ala 


Thr 


Arq 


Ala 


Gly 










15 




Arg 


Gly 


Ala 


Ala 


Gin 


Gly 








30 






Asp 


Thr 


Leu 


Ser 


Met 


Arg 






45 








His 


Gin 


Gin 


Ala 


Arg 


Arg 




60 










Ala 


Ser 


Ala 


Gly 


Ala 


Met 


75 










80 


Val 


lie 


Met 


Gin 


Ala 


Phe 










95 




Trp 


Asp 


Thr 


lie 


Arg 


Gin 








110 






Ser 


Ala 


He 


Trp 


He 


Pro 






125 








Ser 


Met 


Gly 


Tyr 


Asp 


Pro 




140 










Gin 


Lys 


Gly 


Thr 


Val 


Glu 


155 










160 


Asn 


Met 


lie 


Asn 


Thr 


Ala 










175 




lie 


val 


lie 


Asn 


His 


Arg 








190 






Val 


Gly 


Asp 


Tyr 


Thr 


Trp 






205 








Tyr 


Thr 


Ala 


Asn 


Tyr 


Leu 




220 










Asp 


Ser 


Gly 


Thr 


Phe 


Gly 


235 










240 


Trp 


Asp 


Gin 


Tyr 


Trp 


Leu 










255 




Leu 


Arg 


Ser 


lie 


Gly 


He 
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260 

Aso Ala Trp Arg 
275 

Lys Asp Trp Leu 
290 



Aso Thr 


Asn 


Val 


3C5 






Lys Val 


Phe 


Asp 


Asn Lys 


Asn 


lie 






340 


Val Val 


Ser 


Arg 




->d5 




Aso Thr 

* 


Asp 


lie 


7 1 D 
J / U 






Thr Tyr 


Glu 


Gi v 

* 


365 






Leu Asn 


Lys 


Asp 


Ala Gly 


Gly 


Ser 






420 


Phe Val 


Arq 


Asn 




435 




Asn Leu 


Gly 


Ser 








Ala Gly 


Ala 


Cys 








Asp L y s 


Tyr 


Val 


Tyr Asp 


Pro 


Ala 






500 


Gly Val 


Gly 


Thr 




515 




<210> 16 




<211> 820 




2 1 2 > PRT 




<21 3> Art i f i.cial 


<220> 






< 2 2 3 > synthetic 


<4 00> 16 




!-;et Leu 


Ala 


Ala 


1 






Leu Gly 


Val 


Pro 






20 


Leu Arg 


Gly 


Ala 




35 




Thr Ser 


Ala 


Arg 


50 






Gly Ala 


Arg 


Phe 


65 






Ala Lys 


Tyr 


Leu 


Tyr Trp 


Asp 


Val 






100 



Phe 


Asp 


Tyr 


Val 








280 


Asn 


Trp 


Trp Gly 






295 




Asp 


Ala 


Leu 


Leu 




310 






Phe 


Pro 


(on 


T \j r 

i yr 


325 








Pro 


Ala 


Leu 


Val 


Asp 


Pro 


Phe 


Lys 








3 60 


lie 


Trp 


Asn 


Lys 






375 




Gin 


Pro 


Thr 


I le 




390 






Lvs 


Leu 


Lvs 


Asn 


405 








Thr 


Ser 


lie 


Val 


Gl v 

« 


Tyr 


Gly 


Ser 






440 


Ser 


Lys 


Val 


Gly 






455 




lie 


His 


Glu 


Tyr 




470 






Tyr 


Ser 


Ser 


Gly 


485 








Asn 


Gly 


Gin 


Tyr 


Ser 


lie 







Sequence 



Leu 


Ala 


Thr 


Ser 


5 








Asp 


Ala 


Ser 


Thr 


A r g 


Ala 


Ser 


Ala 






40 


Ala 


Ala 


Pro 


Arg 






55 




Pro 


Ser 


Leu 


Val 




70 






Glu 


Leu 


Glu 


Glu 


85 








Pro 


Ser 


Gly 


Gly 



265 



Lys 


Gly 


Tyr 


Gly 


Gly 


Trp 


Ala 


Val 








300 


Asn 


Trp 


Ala 


Tyr 






315 




Tyr 


Lys 


Met 


Asp 




330 






Glu 


Ala 


Leu 


Lys 


345 








Ala 


Val 


Thr 


Phe 


Tyr 


Pro 


Ala 


Tyr 








380 


Phe 


Tyr 


Arg 


Asp 






395 




Leu 


lie 


Trp 


lie 




410 






Tyr 


Tyr 


Asp 


Ser 


425 








Lys 


Pro 


Gly 


Leu 


Arg 


Trp 


Val 


Tyr 








460 


Thr 


Gly 


Asn 


Leu 






475 




Trp 


Val 


Tyr 


Leu 




490 






Gly 


Tyr 


Ser 


Val 



505 



Gin 


Leu 


Val 


Ala 




10 






Phe 


Arg 


Arg 


Gly 


25 








Ala 


Ala 


Asp 


Thr 


His 


Gin 


His 


Gin 








60 


Val 


Cys 


Ala 


Ser 




75 




Gly 


Gly 


Val 


lie 




90 






lie 


Trp 


Trp 


Asp 



105 





270 






Ala 


Trp 


Val 


Val 


285 








Gly 


Glu 


Tyr 


Trp 




Car 


Gl v 

vi y 










320 


Ala 


Ala 


Phe 


Asp 






335 




Asn 


Gly 


Gly 


Thr 




350 






Val 


Ala 


Asn 


His 


365 








Ala 


Phe 


lie 


Leu 


T v r 


Glu 


Glu 


Tr o 








400 


His 


Asp 


Asn 


Leu 






415 




Asp 


Glu 


Met 


lie 




4 30 






lie 


Thr 


Tyr 


lie 


445 








Val 


Pro 


Lys 


Phe 


Gly 


Gly 


Trp 


Val 








480 


Glu 


Ala 


Pro 


Ala 






495 




Trp 


Ser 


Tyr 


Cys 




510 







Thr 


Arg 


Ala 


Gly 






15 




Ala 


Ala 


Gin 


Gly 




30 






Leu 


Ser 


Met 


Arg 


45 








Gin 


Ala 


Arg 


Arg 


Ala 


Gly 


Ala 


Met 








80 


Met 


Gin 


Ala 


Phe 






95 




Thr 


lie 


Arg 


Gin 




110 
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Lvs 


11* 


Pro 


Gl u 


Tro 


Tvr 
•» 


Asp 


Ala 






115 










120 


Pro 


Ala 


Ser 


Lys 


Gly 


Met 


Ser 


Gly 




130 










1 35 




Tyr 


Asp 


Tyr 


Phe 


Asp 


Leu 


Gly 


Glu 


14 5 










150 






Thr 


Arc 


Phe 


Gly 


Ser 


Lys 


Gin 


Glu 










165 








His 


Ala 


Tvr 

« 


Gly 

180 


lie 


Lys 


Val 


lie 


Ala 


G 1 " 
j 


Gly 


Asp 


Leu 


Giu 


Trp 


Asn 






195 










200 


Thr 


Asp 


Phe 


Ser 


Lys 


Val 


Ala 


Ser 




210 










215 




Aso 

t 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


225 










230 






*■» 

u Ly 


Tyr 


Pro 


Asp 


lie 


Cvs 


His 


Asp 










245 








Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 








260 










Asp 


Ala 


Trp 


Arg 


Phe 


Asp 


Tyr 


Val 






275 










280 


Lys 


Asp 


Trp 


Leu 


Asn 


Trp 


Trp 


Giy 




290 










295 




Aso 

k 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 


Leu 


j Kj 5 










310 






!.A'S 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 










325 








Asn 


Lys 


Asn 


Tie 


Pro 


Ala 


Leu 


Val 






340 










Vol 


Val 


Ser 


Arg 


Asp 


Pro 


Phe 


Lys 






355 










360 


Asp 


Thr 


Asp 


lie 


[ le 


Trp 


Asn 


Lys 




370 










375 




Thr 


Tyr 


Glu 


Gly 


Gin 


Pro 


Thr 


He 


385 








390 






Leu 


Asn 


Lys 


Asp 


Lys 


Leu 


Lys 


Asn 










405 








Ala 


Gly 


Gly 


Ser 


Thr 


Ser 


I le 


Val 








420 










Phe 


Va .1 


Arg 


Asn 


Glv 

* 


Tyr 


Gly 


Ser 






435 










4 4 0 


Asn 


Leu 


Gly 


Ser 


Ser 


Lys 


Val 


Gly 




450 










455 




Aia 


G 1 y 


Ala 


Cys 


lie 


His 


Glu 


Tyr 


4 65 










470 






Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 


Gly 










485 








Tyr 


Asp 


Pro 


Ala 


Asn 


Gly 


Gin 


Tyr 






500 










G 1 y 


Val 


Gly 


Thr 


Ser 


lie 


Aia 


Giy 




515 










520 


Thr 


val 


qo r 

— - — 


Pro 


Tyr 


T ■-• r 
* j *■ 


Aia 


Glu 




530 










535 




Gly 


Cys 


Glu 


Leu 


Asp 


Asn 


lie 


Met 


545 










550 






val 


Asn 


Gly 


Met 


Aso 


Val 


Ser 


Glu 



565 



Gly 


lie 


Ser 


Ala 


lie 


Trp, He 


Pro 










125 








Gly 


Tyr 


Ser 


Met 


Gly 


Tyr 


Asp 


Pro 








140 










Tyr 


Tyr 


Gin 


Lys 


Gly 


Thr 


Val 


Glu 






155 










160 


Leu 


lie 


Asn 


Met 


He 


Asn 


Thr 


Ala 




170 










175 




Ala 


Asp 


He 


val 


He 


Asn 


His 


Arg 


185 










1 90 






Pro 


Phe 


Val 


Gly 


Asp 


Tyr 


Thr 


Trp 










205 








Gly 


Lys 


Tyr 


Thr 


Ala 


Asn 


Tyr 


Leu 








220 










Ala 


Gly 


Asp 


Ser 


Gly 


Thr 


Phe 


Gly 






235 










240 


Lys 


Ser 


Trp 


Asp 


Gin 


Tyr 


Trp 


Leu 


250 










255 




Ala 


Tyr 


Leu 


Arg 


Ser 


He 


Gly 


lie 


265 










270 






Lys 


Gly 


Tyr 


Gly 


Ala 


Trp 


Val 


Val 










285 








Gly 


Trp 


Ala 


Val 


Gly 


Glu 


Tyr 


Trp 








300 










Asn 


Trp 


Ala 


Tyr 


Ser 


Ser 


Gly 


Ala 






315 










320 


Tyr 


Lys 


Met 


Asp 


Ala 


Ala 


Phe 


Asp 




330 










335 




Glu 


Ala 


Leu 


Lys 


Asn 


Gly 


Gly 


Thr 


345 










3 50 






Ala 


Val 


Thr 


Phe 


Val 


Ala 


Asn 


HIS 










365 








Tyr 


Pro 


TV 1 

Ala 


Tyr 


Ala 


Phe 


He 


Leu 






380 










Phe 


Tyr 


Arg 


Asp 


Tyr 


Glu 


Glu 


Trp 




395 










400 


Leu 


lie 


Trp 


lie 


His 


Asp 


Asn 


Leu 




4 10 










415 




Tyr 


Tyr 


Asp 


Ser 


Asp 


Glu 


Met 


He 


425 










4 30 






Lys 


Pro 


Gly 


Leu 


He 


Thr 


Tyr 


He 










445 








Arg 


Trp 


Val 


Tyr 


Val 


Pro 


Lys 


Phe 








4 60 










Thr 


Gly 


Asn 


Leu Gly 


Gly 


Trp 


Val 




475 










480 


Trp 


Val 


Tyr 


Leu 


Glu 


Ala 


Pro 


Ala 




490 










495 




Gly 


Tyr 


Ser 


Val 


Trp 


Ser 


Tyr 


Cys 


505 










510 






lie 


Leu 


Glu 


Ala 


Asp 


Arg 


Val 


Leu 










525 








Glu 


Leu 


lie 


Ser 


Gly 


lie 


Ala 


Arg 








540 










Arg 


Leu 


Thr 


Gly 


He 


Thr 


Gly 


He 






555 










560 


Trp 


Asp 


Pro 


Ser 


Arg 


Asp 


Lys 


Tyr 




570 










575 
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lie 


Ala 


Val 


Lys 


Tyr 


Asp 


Val 


Ser 


Thr 


Ala 


Val 


Giu 


Ala 


Lys 


Ala 


Leu 








580 








585 










590 






Asn 


Lys 


Giu 
595 


Ala 


Leu 


Gin 


Ala 


Giu 
600 


Val 


Gly 


Leu 


Pro 


Val 
605 


Asp 


Arg 


Asn 


lie 


Pro 
610 


Leu 


Val 


Ala 


Phe 


lie 
615 


Gly 


Arg 


Leu 


Giu 


Giu 
620 


Gin 


Lys 


Gly 


Pro 


Asp 


Val 


Met 


Ala 


Ala 


Ala 


lie 


Pro 


Gin 


Leu 


Met 


Giu 


Met 


Val 


Giu 


Asp 


625 










630 










635 










640 


val 


Gin 


He 


Val 


Leu 
645 


Leu 


Glv 

■4 


Thr 


Gly 


Lys 
650 


Lys 


Lys 


Phe 


Giu 


Arg 
655 


Met 


Leu 


Met 


Ser 


Ala 

660 


Giu 


Giu 


Lys 


Phe 


Pro 

665 


Gly 


Lys 


Val 


Arg 


Ala 

670 


Val 


Val 


Lvs 

m 


Phe 


Asn 


Ala 


Ala 


Leu 


Ala 


His 


His 


lie 


Met 


Ala 


Gly 


Ala 


Asp 


Val 






675 










680 










685 






Leu 


Ala 
690 


Val 


Thr 


Ser 


Arg 


Phe 
695 


Giu 


Pro 


Cys 


Gly 


Leu 
700 


lie 


Gin 


Leu 


Gin 


G 1 y 


Met 


Ara 


Tyr 


Gly 


Thr 


Pro 


Cys 


Ala 


Cys 


Ala 


Ser 


Thr 


Gly 


Gly 


Leu 


705 










710 










715 










720 


Val 


Asp 


Thr 


lie 


lie 


Giu 


Gly 


Lys 


Thr 


Gly 


Phe 


His 


Met 


Gly Arg 


Leu 










725 










730 










735 




Ser 


Val 


Asp 


Cvs 

« 

740 


Asn 


Val 


Val 


Giu 


Pro 
745 


Ala 


Asp 


val 


Lys 


Lys 
750 


Val 


Ala 


Thr 


Thr 


Leu 
755 


Gin 


Arg 


Ala 


lie 


Lys 
760 


Val 


Val 


Gly 


Thr 


Pro 

765 


Ala 


Tyr 


Giu 


Giu 


Met 
770 


Val 


Arg 


Asn 


Cys 


Met 
775 


He 


Gin 


Asp 


Leu 


Ser 
780 


Trp 


Lvs 


Gly 


Pro 


Ala 


Lys 


Asn 


Trp 


Giu 


Asn 


Val 


Leu 


Leu 


Ser 


Leu 


Gly Val 


Ala 


Gly Gly 


785 










790 










795 










800 


Giu 


Pro 


Gly 


Val 


Giu 


Gly 


Giu 


Giu 


lie 


Ala 


Pro 


Leu 


Ala 


Lys 


Giu 


Asn 










805 








810 










815 




Val 


Ala 


Ala 


Pro 
820 



























<210> 17 
<21l> 19 
< 2 1 2 > PRT 

< 2 1 3 > Artificial Sequence 
<220> 

< 223 > synthetic 
<400> 17 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 

Ala Thr Ser 



<210> 18 
<2U> 444 
<212> PRT 

<213> Thermotoga maritima 
<400> 18 

Met Ala Giu Phe Phe Pro Giu lie Pro Lys He Gin Phe Giu Gly Lys 

15 10 15 

Giu Ser Thr Asn Pro Leu Ala Phe Arg Phe Tyr Asp Pro Asn Giu Val 
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20 



lie 


Asp 


Gly 


Lys 


Pro 


Leu 


Lys 


Asp 






35 










40 


Trp 


His 


Thr 


Phe 


Val 


Asn 


Glu 


Gly 




50 










55 




Ala 


Glu 


Arg 


Pro 


Trp 


Asn 


Arg 


Phe 


65 










70 






Ala 


Arg 


val 


Asp 


Ala 


Leu 


Phe 


Glu 










85 








Tyr 


Phe 


Cys 


Phe 


His 


Asp 


Arg 


Asp 








100 










Arg 


Glu 


Thr 


Asn 


Lys 


lie 


Leu 


Asp 






115 










120 


Arg 


Met 


Lys 


Asp 


Se r 


Asn 


Val 


Lys 




130 










135 




Phe 


Ser 


His 


Pro 


Arg 


Tvr 


Met 


His 


145 










150 






Asd 


Va 1 


Phe 


Ala 


Tyr 


Ala 


Ala 


Ala 








1 65 








Thr 


Lys 


Giu 


Leu 


Giy 


Giy 


Glu 


Gly 








ISO 










Glv 


Tyr 


Glu 


Thr 


Leu 


Leu 


Asn 


Thr 




195 










200 


Leu 


A 1 a 


Arg 


Phe 


Leu 


Arg 


Met 


Ala 




210 










2)5 




Phe 


Thr 


Gly 


Gin 


Phe 


Leu 


lie 


Glu 


225 










230 






His 


Gin 


Tyr 


Asp 


Phe 


Asp 


Val 


Ala 










245 








His 


Giy 


Leu 


Asp 


Glu 


Tyr 


Phe 


Lys 








260 










Thr 


Leu 


Ala 


Gly 


His 


Thr 


Phe 


Gin 






275 










280 


Leu 


Giy 


Lys 


Leu 


Gly 


Ser 


lie 


Asp 




2 90 










295 




Gly 


Trp 


Asp 


Thr 


Asp 


Gin 


Phe 


Pro 


305 










310 






Ala 


Wet 


Tyr 


Glu 


Val 


lie 


Lys 


Ala 








325 








Asn 


Phe 


Asp 


Ala 


Lys 


Val 


Arg 


Arg 








340 










Phe 


lie 


Glv 


His 


lie 


Ala 


Gly 


Met 






355 










360 


lie 


Ala 


Tyr 


Lys 


Leu 


Ala 


Lys 


Asp 




370 










375 




Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Lys 


Glu 












390 






G 1 y 


Lys 


Thr 


Asp 


Phe 


Glu 


Lys 


Leu 










405 








Asp 


lie 


Glu 


Leu 


Pro 


Ser 


Giy 


Lys 






420 










Asn 


Ser 


Tyr 


He 


val 


Lvs 


Thr 


lie 






435 










4 40 



<210> 19 
<211> 1335 
<2!2> DMA 



25 










30 






His 


Leu 


Lys 


Phe 


Ser 


Val 


Ala 


Phe 








45 








Arg 


Asp 


Pro 


Phe 


Gly 


Asp 


Pro 


Thr 








60 










Ser 


Asp 


Pro 


Met 


Asp 


Lys 


Ala 


Phe 






75 










80 


Phe 


Cys 


Glu 


Lys 


Leu 


Asn 


lie 


Glu 




90 










95 




lie 


Ala 


Pro 


Glu 


Gly 


Lys 


Thr 


Leu 


105 










110 






Lys 


Val 


Val 


Glu 


Arg 


He 


Lys 


Glu 








125 








Leu 


Leu 


Trp 


Gly 


Thr 


Ala 


Asn 


Leu 








140 










Gly 


Ala 


Ala 


Thr 


Thr 


Cys 


Ser 


Ala 






155 










160 


Gin 


Val 


Lys 


Lys 


Ala 


Leu 


Glu 


lie 




170 










175 




Tyr 


Val 


Phe 


Trp 


Gly 


Gly 


Arg 


Glu 


185 










190 






Asp 


Leu 


Gly 


Leu 


Glu 


Leu 


Glu 


Asn 










205 








Val 


Glu 


Tyr 


Ala 


Lys 


Lys 


He 


Gly 








220 










Pro 


Lvs 

* 


Pro 


Lys 


Glu 


Pro 


Thr 


Lys 




235 










240 


Thr 


Ala 


Tyr 


Ala 


Phe 


Leu 


Lys 


Asn 




250 










255 




Phe 


Asn 


lie 


Glu 


Ala 


Asn 


His 


Ala 


265 










270 






His 


Glu 


Leu 


Arg 


Met 


Ala 


Arg 


lie 










285 








Ala 


Asn 


Gin 


Gly 


Asp 


Leu 


Leu 


Leu 








300 










Thr 


Asn 


lie 


Tyr 


Asp 


Thr 


Thr 


Leu 






315 








320 


Gly 


Gly 


Phe 


Thr 


Lys 


Gly 


Gly 


Leu 




330 










335 




Ala 


Ser 


Tyr 


Lys 


Val 


Glu 


Asp 


Leu 


345 










350 






Asp 


Thr 


Phe 


Ala 


Leu 


Gly 


Phe 


Lys 










365 








Gly 


Val 


Phe 


Asp 


Lys 


Phe 


w -ft 

I le 


Glu 








380 










Gly 


lie 


Gly 


Lys 


Glu 


lie 


Val 


Glu 






395 










400 


Glu 


Glu 


Tyr 


lie 


lie 


Asp 


Lys 


Glu 




410 










415 




Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 


Leu 


Leu 


425 








4 30 






Ala 


Glu 


Leu 


Arg 
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<213> Thermotoga maricima 



<400> 19 

atggccgagt tcttcccgga gatcccgaag atccagttcg agggcaagga gtccaccaac 60 

ccgctcgccc tccgcttcta cgacccgaac gaggtgatcg acggcaagcc gcccaaggac 120 

cacctcaagt tctccgtggc cttctggcac accttcgtga acgagggccg cgacccgttc 180 

ggcgacccga ccgccgagcg cccgcggaac cgcttctccg acccgatgga caaggccttc 240 

qcccgcctgg acgcectctt cgagttctgc gagaagctca acatcgagta cctccgcttc 300 

cacgaccgcg acatcgcccc ggagggcaag accctccgcg agaccaacaa gatcctcgac 360 

saggtggtgg agcgcatcaa ggagcgcatg aaggactcca acgtgaagct cctctggggc 420 

accgccaacc tcttctccca cccgcgctac atgcacggcg ccgccaccac ctgctccgcc 480 

qacgtgttcg cctacgccgc cgcccaggtg aagaaggccc tggagatcac caaggagctg 540 

ggcggcgagg gctacgtgtt ctggggcggc cgcgagggct acgagaccct cctcaacacc 600 

gacctcggcc tggagctgga gaaccccgcc cgcttcctcc gcatggccgt ggagtacgcc 660 

aagaagatcg gcttcaccgg ccagttcctc atcgagccga agccgaagga gccgaccaag 120 

caccaqtacg acttcgacgt ggccaccgcc tacgccttcc tcaagaacca cggcctcgac 780 

gagcacttca c3gttcaacat cgaggccaac cacgccaccc tcgccggcca caccttccag 840 

cacgagctcc gcatggcccg caccctcggc aagctcggct ccatcgacgc caaccagggc 900 

qacctcctcc tcggctggga caccgaccag ttcccgacca acatctacga caccaccctc 960 

qccatgtacg aggtgatcaa ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc 1020 

aaggcgcgcc gcgcctccta caaggtggag gacctcttca tcggccacat cgccggcatg 1080 

gacaccttcg ccctcggctt caagatcgcc tacaagctcg ccaaggacgg cgtgttcgac 1140 

aagttcatcg aggagaagta ccgctccttc aaggagggca tcggcaagga gatcgtggag 1200 

ggcaagaccg acttcgagaa gctggaggag tacatcatcg acaaggagga catcgagctg 1260 

ccgtccqgca agcaggagta cctggagtcc ctcctcaact cctacatcgt gaagaccatc 1320 

gccgaqctgc gctga 1335 

< 2 1 0 > 20 
< 2 1 1 > 444 
< 2 1 2 > PRT 

< 2 1 5 > Thermotoga neapolitana 



<400> 20 




























Met 


Ala 


Giu 


Phe 


Phe 


Pro 


Glu 


lie 


Pro 


Lys 


Val 


Gin 


Phe 


Glu 


Gly 


Lys 


1 








5 










10 










15 




Glu 


Ser 


Thr 


Asn 
20 


Pro 


Leu 


Ala 


Phe 


Lys 
25 


Phe 


Tyr 


Asp 


Pro 


Glu 
30 


Glu 


He 


lie 


Asp 


Glv 


Lvs 

* 


Pro 


Leu 


Lys 


Asp 


His 


Leu 


Lys 


Phe 


Ser 


Val 


Ala 


Phe 












40 










45 








Trp 


His 


Thr 


Phe 


Val 


Asn 


Glu 


Gly 


Arg 


Asp 


Pro 


Phe 


Gly 


Asp 


Pro 


Thr 


50 










55 










60 










Ala 


Asp 


Arg 


Pro 


Trp 


Asn 


Arg 


Tyr 


Thr 


Asp 


Pro 


Met 


Asp 


Lys 


Ala 


Phe 


65 






70 










75 










80 


Ala 


Arg 


Val 


Aso 

♦ 


Ala 


Leu 


Phe 


Glu 


Phe 


Cys 


Glu 


Lys 


Leu 


Asn 


He 


Glu 






85 










90 










95 




Tur 
■ / - 


Phe 


Cvs 


Phe 


His 


Asp 


Arg 


Asp 


lie 


Ala 


Pro 


Glu 


Gly 


Lys 


Thr 


Leu 




100 










105 










110 






Arg 


Glu 


Thr 


Asn 


Lys 


lie 


Leu 


Asp 


Lys 


Val 


Val 


Glu 


Arg 


He 


Lys 


Glu 






115 








120 










125 








Arg 


Met 


Lys 


Asp 


Ser 


Asn 


Val 


Lys 


Leu 


Leu 


Trp 


Gly 


Thr 


Ala 


Asn 


Leu 


130 








135 










140 










Phe 


Ser 


His 


Pro 


Arg 


Tyr 


Met 


His 


Gly 


Ala 


Ala 


Thr 


Thr 


Cys 


Ser 


Ala 


145 








150 










155 










160 


Asp 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 


Leu 


Glu 


He 








165 










no 










175 




Thr 


Lys 


G.lu 


Leu 
180 


Gly 


Gly 


Glu 


Gly 


Tyr 
185 


Val 


Phe 


Trp 


Gly 


Gly 
190 


Arg 


Glu 


Gly 


Tyr 


Glu 


Thr 


Leu 


Leu 


Asn 


Thr 


Asp 


Leu 


Gly 


Phe 


Glu 


Leu 


Glu 


Asn 
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195 










200 


Leu 


Ala 
210 


Arg 


Phe 


Leu 


Arg 


Met 
215 


Ala 


?he 


Thr 


Gly 


Gin 


Phe 


Leu 


lie 


Giu 


225 










230 






His 


Gin 


Tyr 


Asp 


Phe 
245 


Asp 

• 


Val 


Ala 


His 


Gly 


Leu 


Asp 
260 


Glu 


Tyr 


Phe 


Lys 


Thr 


Leu 


Ala 


Gly 


His 


Thr 


Phe 


Gin 






275 








280 


Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


lie 


Asp 




290 








295 




G 1 v 


Trp 


Asp 


Thr 


Asp 


Gin 


Phe 


Pro 


305 










310 






Al a 


Met 


Tyr 


Glu 


Val 


lie 


Lys 


Ala 








325 








Asn 


Phe 


Asp 


Ala 
340 


Lys 


Val 


Arg 


Arq 


Phe 


lie 


Gly 
355 


His 


lie 


Ala 


Gly 


Met 

360 


Val 


Ala 


Tyr 


Lys 


Leu 


Val 


Lys 


Asp 




3"?0 






375 




Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Arg 


Glu 


385 








390 






Gly 


Lys 


Val 


Asp 


Phe 


Giu 


Lys 


Leu 






405 








Thr 


I ).e 


Glu 


Leu 

420 


Pro 


Ser 


Gly 


Lys 


Asn 


Ser 


Tyr 
435 


lie 


val 


Lys 


Thr 


lie 
440 











205 








Val 


Asp 


Tyr 


Ala 
220 


Lys 


Arg 


lie 


Gly 


Pro 


Lys 


Pro 


Lys 


Glu 


Pro 


Thr 


Lys 




235 










240 


Thr 


Ala 


Tyr 


Ala 


Phe 


Leu 


Lys 


Ser 




250 








255 




Phe 


Asn 


lie 


Glu 


Ala 


Asn 


His 


Ala 


265 










270 






His 


Glu 


Leu 


Arg 


Met 


Ala 


Arg 


lie 








285 








Ala 


Asn 


Gin 


Gly 
300 


Asp 


Leu 


Leu 


Leu 


Thr 


Asn 


Val 


Tyr 


Asp 


Thr 


Thr 


Leu 






315 








320 


Gly 


Gly 


Phe 


Thr 


Lys 


Gly 


Gly 


Leu 


330 










335 




Ala 


Ser 


Tyr 


Lys 


Val 


Glu 


Asp 


Leu 


345 










350 






Asp 


Thr 


Phe 


Ala 


Leu 


Gly 


Phe 


Lys 








365 








Gly 


Val 


Leu 


Asp 


Lys 


Phe 


lie 


Glu 






380 










Gly 


lie 


Gly 


Arg 


Asp 


lie 


Val 


Glu 




395 










400 


Glu 


Glu 


Tyr 


lie 


He 


Asp 


Lys 


Glu 




410 








415 




Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 


Leu 


lie 


425 








430 






Leu 


Glu 


Leu 


Arg 











<210> 21 
<211> 1335 
<212> DNA 

<213> Thermotoga neapolitana 

<400> 21 

atggccgagt tcttcccgga gatcccgaag gtgcagttcg agggcaagga gtccaccaac 

ccgctcacct tcaagttcta cgacccggag gagatcatcg acggcaagcc gctcaaggac 

racctcaagt tctccgtggc cttctggcac accttcgtga acgagggccg cgacccgttc 

ggcgacccga ccgccgaccg cccgtggaac cgctacaccg acccgatgga caaggccttc 240 

gcccgcgtgg acgccctctt cgagttctgc gagaagctca acatcgagta cttctgcttc 300 

racoaccqcq acatcgcccc ggagggcaag accctccgcg agaccaacaa gatcctcgac 360 

aaggtgctgq aqcqcatcaa ggagcgcatg aaggactcca acgtgaagct cctctggggc 420 

Qr-cqccaacc cctcccccca cccccgctac atgcacggcg ccgccaccac ctgctccgcc 480 



60 

120 

180 



gacctgttcg cctacgccgc cgcccaggtg aagaaggccc tggagatcac caaggagctg 540 
caccagtacg acctcgacgt ggccaccgcc tacgccttcc tcaagtccca cggcctcgac 780 



ggccgcgagg gctacgtgtt ctggggcggc cgcgagggct acgagaccct cctcaacacc 600 
aacctcggct tcgagctgga gaacctcgcc cgcttcctcc gcatggccgt ggactacgcc 
aagcgcatcg gcttcaccgg ccagttcctc atcgagccga agccgaagga gccgaccaag 7^0 



660 



840 



gagcacttca agctcaacat cgaggccaac cacgccaccc tcgccggcca caccttccag 

racoaqczqz acatagcccg catcctcggc aagctcggct ccatcgacgc caaccagggc 900 

gacctcctcc tcggctggga caccgaccag ttcccgacca acgtgtacga caccaccctc 960 

gccacgtacg agatgatcaa ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc 1020 

aaggcgcgcc gcgcctccta caaggtggag gacctcttca ccggccacat cgccggcatg 1080 

gacacctccg ccctcggctt caaggtggcc tacaagctcg cgaaggacgg cgtgctcgac 1140 
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aagcccaicg aqgagaagta ccgctccttc cgcgagggca tcggccgcga catcgtggag 1200 

ggcaaggcgg acttcgagaa gctggaggaq lacatcatcg acaaggagac catcgagctg 1260 

ccqtccggca agcagqagta cctggagtcc ctcaccaact cctacatcgt gaagaccatc 1320 

ctggagccqc gctga 1335 

<210> 22 

<2\l> 28 

<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

< 2 2 3 > synthetic 

<A00> 22 

agcgaattca tggcggctct ggccacgt 28 

<210> 23 

<21 1 > 29 

<2 1 2 > DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 

<400> 23 

agctaagctt cagggcgcgg ccacgttct 29 

<210> 24 

<2il> 325 

< 2 1 2 > PRT 

< 2 1 3 > Artificial Sequence 
<220> 

<223> synthetic 



<400> 24 



Met 


Arg 


Va i 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 








5 










10 










15 




Ala 


Thr 


Ser 


Ala 


Gly 


His 


Trp 


Tyr 


Lys 


His 


Gin 


Arg 


Ala 


Tyr 


Gin 


Phe 








20 








25 










30 






Thr 


31 y 


Glu 


Asp 


Asp 


Phe 


Gly 


Lys 


Val 


Ala 


Val 


Val 


Lys 


Leu 


Pro 


Met 




35 










40 










45 








Asp 


Leu 


Thr 


Lys 


Val 


Gly 


lie 


lie 


Val 


Arg 


Leu 


Asn 


Glu 


Trp 


Gin 


Ala 


50 










55 










60 










Lvs 
65 


Asp 

* 


Val 


Ala 


Lys 


Asp 


Arg 


Phe 


lie 


Glu 


lie 


Lys 


Asp Gly 


Lys 


Ala 






70 










75 










80 


Glu 


Val 


Trp 


lie 


Leu 


Gin 


Gly 


Val 


Glu 


Glu 


lie 


Phe 


Tyr 


Glu 


Lys 


Pro 








85 










90 










95 




Asp 


Thr 


Ser 


Pro 


Arg 


lie 


Phe 


Phe 


Ala 


Gin 


Ala 


Arg 


Ser 


Asn 


Lys 


Val 






100 








105 










110 






lie 


Glu 


Ala 
115 


Phe 


Leu 


Thr 


Asn 


Pro 
120 


Val 


Asp 


Thr 


Lys 


Lys 
125 


Lys 


Glu 


Leu 


Phe 


Lvs 


Val 


Thr 


Val 


Asp 


Gly 


Lys 


Glu 


lie 


Pro 


Val 


Ser 


Arg 


Val 


Glu 




130 








135 










140 










Lys 


Ala 


Asp 


Pro 


Thr 


Asp 


lie 


Asp 


Val 


Thr 


Asn 


Tyr 


Val 


Arg 


lie 


Val 


145 








150 










155 










160 


Leu 


Se r 


Glu 


Ser 


Leu 
165 


Lys 


Glu 


Glu 


Asp 


Leu 

no 


Arg 


Lys 


Asp 


Val 


Glu 
175 


Leu 
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lie 


lie 


Glu 


Gly 


Tyr 


Lys 


Pro 


Ala 








180 










Asp 


Asp 


Tyr 


Tyr 


Tyr 


Asp 


Gly 


Glu 






195 










AAA 

200 


Lys 


Thr 


Me 


Phe 


Arg 


Val 


Trp 


Ser 


210 








215 




Leu 


Leu 


Phe 


Lys 


Asn 


Gly 


Glu 


Asp 


225 










230 






Me c 


Glu 


Tyr 


Lys 


Gly 


Asn 


ol y 


Va 1 










245 








Leu 


Asp 


GJy 


Val 


Phe 


Tyr 


Leu 


Tyr 








260 










Arc 


Thr 


Thr 


Va I 


Asp 


Pro 


Tyr 


Ser 




275 










280 


Glu 


Ser 


Ala 


Val 


Val 


Asn 


Leu 


Ala 




290 










295 




Asn 


Asp 


Arg 


Gly 


Pro 


Lys 


lie 


Glu 


305 










310 






Glu 


lie 


His 


lie 


Ala 


Asp 


He 


Thr 










325 








Asn 


Lys 


Gly 


Leu 


Tyr 


Leu 


Gly 


Leu 








34 0 










Gly Glv 


Val 


Thr 


Thr 


Gly 


Leu 


Ser 






355 










360 


His 


Val 


His 


lie 


Leu 


Pro 


Phe 


Phe 




370 










375 




Asp 


Lys 


Asp 


Phe 


Glu 


Lys 


Tyr 


Tyr 


365 










390 






Phe 


Met 


Val 


Pro 


Glu 


Gly 


Arg 


Tyr 










4 05 








Thr 


Arg 


He 


Arg 


Glu 


Val 


Lys 


Glu 








420 










Glv 


He 


Gly 


Val 


lie 


Met 


Asp 


Met 




4 35 










4 40 


Gly 


Glu 


Leu 


Ser 


Ala 


Phe 


Asp 


Gin 


4 50 










4 55 




lie 


Asp 


Lys 

4 


Thr 


Gly 


Ala 


Tyr 


Leu 


4 65 










4 70 






lie 


Ala 


Ser 


Glu 


Arg 


Pro 


Met 


Met 










4 85 








Thr 


Tyr 


Tr d 


Va i 


Lys 


Glu 


Tyr 


His 






500 










Met 


Gly 


Leu 


1 1 e 


Asp 


Lys 


Lys 


Thr 






51 5 










520 


His 


Lys 


He 


Asp 


Pro 


Thr 


He 


He 




530 










535 




Trp 


G 1. y 


A - a 


Fro 


lie 


Arg 


Phe 


Gly 


54 5 










550 






Val 


Ala 


Ala 


Phe 


Asn 


Asp 


Glu 


Phe 










565 








Phe 


Asn 


Pro 


Ser 


Val 


Lys 


Gly 


Phe 








580 










Thr 


Lys 


lie 


Lys 


Arg 


Gly 


Val 


Val 






595 










600 


Leu 


lie 


Lys 


Ser 


Phe 


Ala 


Leu 


Asp 




610 








615 




Ala 


Cys 


His 


Asp 


Asn 


His 


Thr 


Leu 


625 








630 







Arg 


val 


lie 


Met 


Met 


Glu 


He 


Leu 


185 










190 






Leu 


Gly 


Ala 


Val 


Tyr 


Ser 


Pro 


Glu 










205 








Pro 


Val 


Ser 


Lys 


Trp 


Val 


Lys 


val 








a a 

220 










Thr 


Glu 


Pro 


Tyr 


Gin 


Val 


Va 1 


Asn 






235 








2 4 0 


Trp 


Glu 


Ala 


Va 1 


Val 


Glu 


Gly 


Asp 




250 










255 




Gin 


Leu 


Glu 


Asn 


Tyr 


Gly 


Lys 


lie 


265 










270 






Lys 


Ala 


Val 


Tyr 


Ala 


Asn 


Asn 


Gin 








285 








Arg 


Thr 


Asn 


Pro 


Glu 


Gly 


Trp 


Glu 








300 










Gly 


Tyr 


Glu 


Asp 


Ala 


lie 


He 


Tyr 






315 










320 


Gly 


Leu 


Glu 


Asn 


Ser 


Gly 


Val 


T • 1 M 

Lys 




330 










335 




Thr 


Glu 


Glu 


Asn 


Thr 


* 

Lys 


Ala 


Pro 


345 










350 






His 


Leu 


Val 


Glu 


Leu 


Gly 


Val 


Thr 










365 








Asp 


Phe 


Tyr 


Thr 


Gly 


Asp 


Glu 


Leu 








380 










Asn 


Trp 


Gly 


JVM 

Tyr 


* 

Asp 


Pro 


Tyr 


Leu 






395 










400 


Ser 


Thr 


ft 

Asp 


Pro 


Lys 


Asn 


Pro 


His 




4 10 










415 




Met 


Val 


Lys 


Ala 


Leu 


His 


Lys 


HIS 


4 25 










4 30 






Va 1 


Phe 


Pro 


MIS 


l nr 


Tyr 


vji y 


Tie* 

lie 










4 4 5 








Thr 


Val 


Pro 


Tyr 


Tyr 


Pne 


Tyr 


Arg 








4 60 










Asn 


Glu 


Se r 


Gly 


Cys 


Gly 


Asn 


va 1 






475 










480 


Arg 


* 

Lys 


Phe 


lie 


Va 1 


Asp 


Thr 


va l 


4 90 










/IOC 

4 95 




l le 


Asp 


Gly 


Phe 


Arg 


Pne 


ASp 


bin 


505 










510 






Met 


Leu 


Glu 


va 1 


Glu 


Arg 


ai a 


Le u 










525 








Leu 


Tyr 


Gly 


Glu 


Pro 


Trp 


Gly 


Gly 








540 










Lys 


Ser 


Asp 


Va 1 


Ala 


Gly 


Thr 


His 






555 










560 


Arg 


Asp 


Ala 


He 


Arg 


Gly 


Ser 


Val 




570 










575 




Val 


Met 


Gly 


Gly 


Tyr 


Gly 


Lys 


Glu 


595 










590 






Gly 


Ser 


He 


Asn 


Tyr 


Asp 


Gly 


Lys 










605 








Pro 


Glu 


Glu 


Thr 


He 


Asn 


Tyr 


Ala 








620 










Trp 


Asp 


Lys 


Asn 


Tyr 


Leu 


Ala 


Ala 






635 










640 
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Lys 


Ala 


Asp 


Lys 


Lys 

6'; 5 


Lvs 

■* 


bl u 


1 ro 

* 


Gin 


Lys 


Leu 


Ala 
660 


Gly 


Ala 


T 1 ^ 

lie 


Leu 


Leu 


His 


Gi y 
o7 5 


Giy 


Gin 


Asp 


n i-> 

Phe 


Lys 

DOU 


Ser 


Tyr 
690 


Asn 


Ala 


Pro 


I le 


Sex 
695 


i le 


Leu 


G! n 


Phe 


lie 


Asp 


Va 1 


Phe 


as n 


7 05 










■J i a 
i 1 0 






Arg 


Lvs 


Glu 


His 


Pro 


Al a 


Phe 


Arg 






7 2 5 








Lys 


His 


Leu 


Glu 


r-\ 1 

Phe 


Leu 


Pro 


Gly 






7 4 0 










Leu 


Lys 


Asp 


HIS 


Aia 


bi y 


Ijiy 


n5 p 




755 










760 


Tyr 


Asn 


Gly 


Asn 


Leu 


Glu 


Lys 


Thr 


770 










775 




Trp Asn 


val 


Val 


Val 


Asn 


Ser 


Gin 


735 










790 






Thr 


Val 


Glu 


Gl y 


Thr 


lie 


Glu 


Leu 








605 








Tvr 


Arg 


Glu 


Ser 


Glu 


Lys 


Asp 


Glu 



820 



Thr 


r 1 H 
b 1 U 

CCA 


r i it 
bl U 


pin 

bl U 


Leu 


Lys 


Hen 
Moll 

^ Ci 
v J J 


r\l a 


Leu 


Thr 


Ser 


Gin 


Gly 


Val 


Pro 


Phe 


665 
















ai g 


Til y 

l n r 


i n r 


T\ r\ 

as n 


r ne 


as n 




Hen 








cot 

DO O 








Asn 


Gly 


Phe 


Asp 

~i A A 

700 


Tyr 


Glu 


Arg 


Lys 


Tyr 


His 


Lys 


Gly 


Leu 


lie 


Lys 


Leu 




715 












Leu 


Lys 

T "J A 

7 30 


Asn 


Al a 


blU 


blU 


lie 

1 "3 C. 


Lys 


Gly 


Arg 


Arg 


T 1 ^ 

lie 


val 


Ala 


r ne 


we t 


74 5 










"1 C A 

7 50 






Pro 


Trp 


Lys 


Asp 


lie 

765 


Val 


Val 


He 


Thr 


Tyr 


Lys 


Leu 
780 


Pro 


Glu 


Gly 


Lys 


Lys 


Ala 


Gly 


Thr 


Glu 


Val 


lie 


Glu 




795 










800 


Asp 


Pro 


Leu 


Ser 


Ala 


Tyr 


val 


Leu 


810 










815 





Leu 
825 



60 



<2 10> 25 
<21!> 2^78 
< 2 1 2> DMA 

< 2 1 3 > Artificial Sequence 
<220> 

< 2 2 3 > synthetic 
<400> 25 

atgagagtgt tgcticgttgc cctcgctctc ctggctctcg ctgcgagcgc caccagcgct 

ggccacuggc acaagcacca gcgcgcctac cagttcaccg gegaggaega ettegggaag 120 

gtggccg'-gg tgaagctccc gatggacctc accaaggtgg geatcategt gcgcctcaac 180 

gageggcagg egaaggaegt ggecaaggae cgcttcatcg agatcaagga cggcaaggcc 240 

gaggcgtgga tactccaggg cgtggaggag atcttctacg agaagcegga cacctccccg 300 

cgcaccttct tcgcccaggc ccgctccaac aaggtgatcg aggccttcct caccaacccg 360 

acqgacacca agaagaagga gctgttcaag gtgacegteg aeggcaagga gatcceggtg 420 

tcccgcgtgg agaaggcega cccgaccgac atcgacgtga ccaactacgt gcgcatcgtg 480 

ctccccgagt ccctcaagga ggaggacctc cgcaaggacg tggagctgat catcgagggc 540 

tacaagcegg cccgcgtgac catgatggag atcctcgacg actactacta egaeggegag 600 

ctgggggcgg tgtactcccc ggagaagacc atcttccgcg tgtggtcccc ggtgtccaag 660 

taggtgaaag tgctcctcte caagaaegge gaggacaccg agccgtacca ggtggtgaac 720 

atcgaataca agggcaaegg cgtgtgggag gccgtggtgg agggegaect egaeggegtg 780 

ctctacctct accagctgga gaactaegge aagatccgea ccaccgtgga cccgtactcc 840 

aaggccgtge acgccaacaa ccagcagtct gcagtggtga acctcgcccg caccaacccg 900 

gagggctggg agaacgaccg cggcccgaag atcgagggct aegaggaege catcatctac 960 

gagatccaca tcgccgacat caccggcctg gagaactccg gcgtgaagaa caagggcctc 1020 

tacctcggcc ccaccgagga gaacaccaag gccccgggcg gcgtgaccac cggcctctcc 1080 

caccccacgg agctgggcgt gacccacgcg cacatcctcc cgttcttcga cttctacacc 1140 

ggegacgage eggacaagga ctccgagaag tactacaacc ggggctacga cccgtacctc 1200 

ttcatggtgc eggagggecg ccaccccacc gacccgaaga acccgcacac ccgaattcgc 1260 

aaggegaagg agatggtcaa ggccctccac aagcaeggea teggegtgat catggacacg 1320 

gugctcccgc acacctacgg categgegag ctgtccgcct tcgaccagac cgtgccgtac 1380 
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cacttctacc gcatcgacaa gaccggcgcc cacctcaacg agtccggctg cggcaacgtg 1440 

atcgcctcco agcgcccgat gatgcgcaag ttcatcgtgg acaccgtgac ctactgggtg 1500 

aaggagtecc acatcgacgg cctccgcttc gaccagatgg gcctcatcga caagaagacc 1560 

atgctggaag tggagcgcgc cctccacaag atcgacccga ccatcatcct ctacggcgag 1620 

ccgnggggcg gccggggggc cccgatccgc ttcggcaagt ccgacgtggc cggcacccac 1680 

gtgqccgcct tcaacgacga gttccgcgac gccacccgcg gctccgtgtt caacccgtcc 1740 

gtgaagggct tcgtgacggg cggctacggc aaggagacca agatcaagcg cggcgtggtg 1800 

gqctccatca actacgacgg caagctcatc aagtccttcg ccctcgaccc ggaggagacc 1860 

accaactacg ccgcctgcca cgacaaccac accctctggg acaagaacta cctcgccgcc 1920 

aaggccgaca agaagaagga gtggaccgag gaggagctga agaacgccca gaagctcgcc 1980 

ggcgccatcc tcctcaccag tcagggcgtg ccgttcctcc acggcggcca ggacttctgc 2040 

cccaccacca acttcaacga caactcctac aacgccccga tctccatcaa cggcttcgac 2100 

tacgagcgca agccccagtc catcgacgtg ttcaactacc acaagggcct catcaagctc 2160 

cgcaaggagc acccggcctc ccgcctcaag aacgccgagg agatcaagaa gcacctggag 2220 

tccctcccgg gcgggcgccg catcgtggcc ttcatgctca aggaccacgc cggcggcgac 2280 

ccgtggaagg acatcguggt gatctacaac ggcaacctgg agaagaccac ctacaagctc 2340 

ccggagggca agtggaacgt qgtggtgaac tcccagaagg ccggcaccga ggtgatcgag 2400 

accgtggagg gcaccatcga gctggacccg ccctccgcct acgtgctcta ccgcgagtcc 2460 

gagaaggacg agctgtga 2478 

<210> 26 
< 2 1 1 > 713 
<2\2> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 26 



Met 


Arg 


Val 


Leu 


Leu 


Vai 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


1 






5 










10 










15 




Ala 


Thr 


Ser 


Met 
20 


Glu 


Thr 


He 


Lys 


lie 

25 


Tyr 


Glu 


Asn 


Lys 


Gly 
30 


Val 


Tyr 


Lys 


Val 


va l 


lie 


Gly 


Glu 


Pre 


Phe 


Pro 


Pro 


He 


Glu 


Phe 


Pro 


Leu 


Glu 






35 








40 










45 








Gin 


Lvs 
50 


lie 


Ser 


Ser 


Asn 


Lys 

5b 


Ser 


Leu 


Ser 


Glu 


Leu 
60 


Gly 


Leu 


Thr 


He 


Val 


Gin 


Gin 


Gly 


Asn 


Lvs 


Val 


lie 


Val 


Glu 


Lys 


Ser 


Leu 


Asp 


Leu 


Lys 


65 










7 0 










75 










80 


Glu 


His 


lie 


lie 


Gly 
85 


Leu 


Gly 


Glu 


Lys 


Ala 

90 


Phe 


Glu 


Leu 


Asp 


Arg 
95 


Lys 


Arg 


Lys 


Arg 


Tyr 


Val 


Met 


Tyr 


Asn 


Val 


Asp Ala 


Gly 


Ala 


Tyr 


Lys 


Lys 




100 










105 










110 






Tyr 


Gin 


Asp 


Pro 


Leu 


Tyr 


Val 


Ser 


He 


Pro 


Leu 


Phe 


He 


Ser 


Val 


Lys 




115 








120 










125 








Asp 


Gly 


Val 


Ala 


Thr 


Gly 


Tyr 


Phe 


Phe 


Asn 


Ser 


Ala 


Ser 


Lys 


Val 


lie 


130 










135 










140 










Phe 


Asp 


Val 


Gly 


Leu 


Glu 


Giu 


Tyr 


Asp 


Lys 


Val 


He 


Val 


Thr 


He 


Pro 


14 5 






150 










155 










160 


Glu 


Asp 


Ser 


Vai 


Glu 


Phe 


Tyr 


Val 


lie Glu Gly 


Pro 


Arg 


He 


Glu 


Asp 








165 










170 










175 




Val 


Leu 


Glu 


Lys 


Tyr 


Thr 


Giu 


Leu 


Thr 


Gly 


Lys 


Pro 


Phe 


Leu 


Pro 


Pro 








180 








185 










190 






Met 


Trp Ala 


Phe 


Gly 


Tyr 


Met 


lie 


Ser 


Arg 


Tyr 


Ser 


Tyr 


Tyr 


Pro 


Gin 






195 










200 










205 








Asp 


Lys 


Val 


Val 


Glu 


Leu 


Val 


Asp 


He 


Met 


Gin 


Lys 


Glu 


Gly 


Phe 


Arg 


210 










215 










220 










Vai 


Ala 


Gly 


Val 


Phe 


Leu 


Asp 


lie 


His 


Tyr 


Met 


Asp 


Ser 


Tyr 


Lys 


Leu 
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225 230 



r> u ^ 

Fhe 


Th r 


Trp 


His 


Pro 


Tyr 


Arg 


Phe 


















Glu 


* 

Leu 


His 


* 

Lys 


Arg 


Asn 


Val 


Lys 








260 










lie 


Arg 


Va 1 


Aso 

• 


G 1 n 


Asn 


Tyr 


Ser 




j "i n c 

^ / 5 










1 O A 

280 


f-ne 


Cys 


Glu 


T 1 e 


Giu 


Ser 


Gly 


Glu 




290 










295 




Gly 


Thr 


Thr 


Va 1 


Tvr 


Pro 


Asp 


Phe 


.>0d 










310 






Ttd 

* 


Ala 


Gl v 

• 


Leu 


lie 


Ser 


Glu 


Trp 








325 








Tro 


Leu 


Asp 


Met 


Asn 


Glu 


Pro 


Thr 








34 0 










A t cj 


Asp 


Va 1 


Leu 


Se r 


Se r 


Leu 


Pro 


355 










O i*A 

360 


Va 1 


Thr 


Thr 


Fhe 


Pro 


Asp 


Asn 


Va 1 




370 










37 5 




Va 1 


Lys 


His 


Giu 


Lys 


7a 1 


Arg 


Art . 

Asn 


38 5 










"5 A A 






A 1 a 


Thr 


Fhe 


Lys 


Gly 


Phe 


Arg 


TIL „ 

Thr 










4 05 








Leu 


Ser 


Ara 


Ala 


Gl y 


Tyr 


Ala 


Gly 








420 










1 n r 


G 1 v 


Aso Asn 

* 


in r 


Fro 


Ser 


Trp 




135 










/I .1 A 

4 4 U 


Leu 


Va 1 


Leu 


Gly 


Leu 


Ser 


lie 


O «. I_V 

Ser 




i c a 

4 r>0 










/ICC 

4 b b 




I 1.G 


<o 1 y 


G 1 y 


Phe 


Gin 


Gl y 


Arg 


As n 


4 b5 










4 / U 






Asp 


Leu 


Leu 


Val 


Lys 


1 yr 


Tyr 


TV 1 . 

Ai a 


















Ara 


Se r 


His 


Lys 


Mia 


Thr 


Asp 


Gly 








500 










Pro 


Asp 


Tyr 


Tyr 


Lys 


Glu 


Lys 


va i 






51 5 










C T A 


Lys 


Fne 


Leu 


Fro 


Tyr 


He 


Tyr 


Ser 


530 










c o c 

5 35 




Gly 


His 


Pro 


Val 


He 


Arg 


Pro 


Leu 


54 5 










CCA 

b bu 






Asp 


Wet 


Tyr 


Arg 


lie 


Glu 


Asp 


GIU 










c £ c 

565 








Tyr 


Ala 


Fro 


He 


Val 


Ser 


Lys 


Glu 






560 










Arg 


Gly 


Lys 


Tro 

i 


Tvr 


A s n 


IT* 

! yr 


Trp 














oOO 


Ser 


Va l 


va i 


Lvs 

- 


Ser 


1 nr 


His 


Glu 




610 








6 1 5 






: i e 


Tie 


Pro 


Leu 


Glu 


Gly 


Asp 


625 










630 






Se r 


Phe 


Lys 


Arc 


Tyr 


Asp 


Asn 


Ala 










64 5 








I le 


Lys 


Phe 


Ser 


Arg 


Glu 


lie 


Tyr 








660 










Glu 


Lys 


Pro 


Val 


Ser 


Lys 


lie 


lie 






675 










680 


Val 


Glu 


Lys 


Thr 


Met 


Gin 


Asn 


Thr 







2 35 










0 A A 

2 4 0 


D *~ ^ 

r ro 


(jl U 


D W* ^K. 

Pro 


Lys 


Lys 


T All 

Leu 


lie 


MSp 




OCA 

250 










T C C 

2 bb 




Leu 


T 1 Q 

i le 


i nr 


lie 


va i 


MSp 


n 1 S 


gi y 


T C C 

<i 00 
















n w 

Fro 


Phe 


Leu 


Ser 


Gly 


Met 


Gly 


Lys 










o o c 








Leu 


Pne 


va l 


Gly 


Lys 


riec 


i rp 


Fro 








"} A A 

300 










r ne 


Arg 


GIU 


Asp 


T W v 

i nr 


Mrg 


GIU 


Trp 






3 1 5 










T A A 

320 


Leu 


Ser 


Gin 


Gly 


Val 


Asp 


Gly 


I le 




330 










335 




Asp 


Phe 


Ser 


Arg 


Ala 


lie 


Glu 


lie 


34 5 










350 






Va 1 


Gin 


Phe 


Arg 


Asp 


Asp 


Arg 


Leu 










o e c 

365 








val 


his 


Tyr 


Leu 


Mrg 


g i y 


Lys 


A v n 

Mrg 








380 










Ala 


Tyr 


O v> r^. 

Fro 


Leu 


Tyr 


GIU 


ai a 


ne l 






j y b 










yl A A 


Ser 


his 


Arg 


Asn 


Glu 


lie 


Pne 


T 1 r\ 

lie 




A 1 A 

4 1 U 










A 1 C 

4 1b 




Tie 


Gin 


Arg 


Tyr 


Ala 


Phe 


T 1 

lie 


Trp 


/IOC 

4 2 5 










/ "3 A 
4 JU 






Asp 


Asp 


Leu 


Lys 


Leu 


Gin 


Leu 


Gin 










A A C 

4 4b 








o l y 


va i 


D 

Fro 


Fne 


vai 


gi y 


l y s 


MSp 








4 bU 










pne 


Ala 


(jl U 


i ie 


MSp 


Asn 


C 1 v 

ber 


ne l 






A 1 C. 










4 q n 
sou 


Le u 


Mia 


I Ail 

Leu 


r ne 


rne 




rne 


i yr 




/I OA 

4 rU 










4 ?b 




T 1 o 

lie 


HSp 


i nr 


o 1 u 


r ro 


va i 


rne 


Leu 










CIA 

D 1 U 






Lys 


GlU 


lie 


Va 1 


GlU 


Leu 


Arg 


Tyr 










b<£ D 








Leu 


Ala 


Leu 


Vj 1 U 


ai a 


oer 


pin 

GIU 


Lys 








C A A 

D 4 U 










pne 


Tyr 


Glu 


Fne 


Gin 


MSp 


ASp 


MS p 






555 










CCA 

b bU 


Tyr 


Met 


Val 


Gly 


Lys 


Tyr 


Leu 


T All 

Leu 




570 










b / b 




Glu 


Ser 


Arg 


Leu 


Va 1 


Thr 


Leu 


Fro 


C 3 C 

55 5 










C AA 






Asn 


Gly Glu 


lie 


He 


Asn 


Gly 


Lys 










/"AC 

605 








Leu 


Pro 


lie 


Tyr 


Leu 


Arg 


GlU 


Gly 








620 










G 1 U 


Leu 


lie 


va i 


Tyr 


Giy 


GIU 


TK r- 

i n r 






635 










640 


Glu 


Tie 


Thr 


Ser 


Ser 


Ser 


Asn 


Glu 




650 










655 




Val 


Ser 


Lys 


Leu 


Thr 


He 


Thr 


Ser 


665 










670 






Val 


Asp Asp 


Ser 


Lys 


Glu 


He 


Gin 










685 








Tyr 


Val 


Ala 


Lys 


He 


Asn 


Gin 


Lys 
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690 695 700 

He Arg Gly Lys Tie Asn Leu Glu Ser Glu Lys Asp Glu Leu 
70S ' 7 10 715 



<210> 27 

< 2 1 1 > 712 

<212> PRT 

< 2 1 3 > Artificial Sequence 
<220> 

<: 2 2 3 > synthetic 



<4C0> 27 



Me t 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


i 

A 








5 










10 










15 




Ala 


Thr 


Ser 


Met 
20 


Glu 


Thr 


He 


Lys 


He 
25 


Tyr 


Glu 


Asn 


Lys 


Gly 
30 


Val 


Tyr 


Lys 


val 


Val 


He 


Gly 


Glu 


Pro 


Phe 


Pro 


Pro 


lie 


Glu 


Phe 


Pro 


Leu 


Glu 






35 








40 










45 








Gin 


Lys 
50 


lie 


Ser 


Ser 


Asn 


Lys 
55 


Ser 


Leu 


Ser 


Glu 


Leu 
60 


Gly 


Leu 


Thr 


He 


Val 


Gin 


Gin 


Gly 


Asn 


Lys 


Val 


He 


Val 


Glu 


Lys 


Ser 


Leu 


Asp 


Leu 


Lys 


65 








70 










75 










80 


Glu 


His 


lie 


He 


Gly 
35 


Leu 


Gly 


Glu 


Lys 


Ala 

90 


Phe 


Glu 


Leu 


Asp 


Arg 

95 


Lys 


Arg 


Lvs 

• 


Arg 


Tyr 


Val 


Met 


T y r 


Asn 


Val 


Asp 


Ala 


Gly 


Ala 


Tyr 


Lys 


Lys 


100 










105 










110 






Tyr 


Gin 


Asp 


Pro 


Leu 


Tyr 


Va 1 


Ser 


He 


Pro 


Leu 


Phe 


He 


Ser 


Val 


Lys 






115 










120 










125 








Asp 


Gly 


Val 


Ala 


Thr 


Gly 


Tyr 


Phe 


Phe 


Asn 


Ser 


Ala 


Ser 


Lys 


Val 


He 


3 30 








135 










140 










Fhe 


Asp 


Val 


Gly 


Leu 


Giu 


Glu 


Tyr 


Asp 


Lys 


Val 


He 


Val 


Thr 


lie 


Pro 


145 






150 










155 










160 


Glu 


Asp 


Ser 


Val 


Glu 


Phe 


Tyr 


Val 


He 


Glu 


Gly 


Pro 


Arg 


lie 


Glu 


Asp 








165 










170 










175 




Val 


Leu 


Glu 


Lys 


Tyr 


Thr 


Glu 


Leu 


Thr 


Gly 


Lys 


Pro 


Phe 


Leu 


Pro 


Pro 








180 








185 










190 






Met 


Trp Ala 


Phe 


Gly 


Tyr 


Met 


He 


Ser 


Arg 


Tyr 


Ser 


Tyr 


Tyr 


Pro 


Gin 






195 










200 










205 








Asp 


Lvs 


Val 


Val 


Glu 


Leu 


Val 


Asp 


He 


Met 


Gin 


Lys 


Glu 


Gly 


Phe 


Arg 


210 










215 










220 










Va 1 


Ala 


Gly 


Val 


Phe 


Leu 


Asp 


He 


His 


Tyr 


Met 


Asp 


Ser 


Tyr 


Lys 


Leu 


225 








230 










235 










240 


?he 


Thr 


Trp 


His 


Pro 


Tyr 


Arg 


Phe 


Pro 


Glu 


Pro 


Lys 


Lys 


Leu 


He 


Asp 








245 








250 










255 




GIu 


Leu 


His 


Lvs 


nl G 


Asn 


Val 


Lys 


Leu 


He 


Thr 


He 


Val 


Asp 


His 


Gly 






260 








265 










270 






He 


Arc 


Val 


Asp 


Gin 


Asn 


Tyr 


Ser 


Pro 


Phe 


Leu 


Ser 


Gly 


Met 


Gly 


Lys 




275 








280 










285 








Phe 


Cys 
290 


Glu 


lie 


Glu 


Ser 


Gly 
295 


Glu 


Leu 


Phe 


Val 


Gly 
300 


Lys 


Met 


Trp 


Pro 


Gly 


Thr 


Thr 


Val 


Tyr 


Pro 


Asp 


Phe 


Phe 


Arg 


Glu 


Asp 


Thr 


Arg 


Glu 


Trp 


305 








310 










315 










320 


Tro 

* 


Ala 


Gly 


Leu 


lie 


Ser 


Glu 


Trp 


Leu 


Ser 


Gin 


Gly 


Val 


Asp 


Gly 


He 








325 










330 










335 




Trp 


Leu 


Asp 


Met 


Asn 


Glu 


Pro 


Thr 


Asp 


Phe 


Ser 


Arg 


Ala 


He 


Glu 


He 






340 










345 










350 
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Ara 


Asp 


Val 


* 

Leu 


Se r 


Ser 


Leu 


D V* JF-V 

r r 0 


v a i 




355 










J oU 




it y 

Va 1 


Thr 


Thr 


Phe 


Pro 


TV ™ 

Asp 


Asn 


va l 


v a i 




370 










3 / b 






Val 


Lys 


His 


Glu 


Lys 


Val 


Arg 


Asn 


Ala 


St 0 0 








390 








Ala 


Thr 


Phe 


Lys 


Gly 


Phe 


Arg 


Thr 


be r 










4 05 










Leu 


Ser 


Arg 


Ala 


Gly 


Tyr 


Ala 


Gly 


Tl- 
116 






4 20 










4 b 


Thr 


Gly 


Asp 


Asn 


Thr 


Pro 


Se r 


Trp 


ASp 




4 3 5 










J d A 
4 H U 




Leu 


Val 


Leu 


G 1 y 


Leu 


Ser 


lie 


c -» ^ 

Ser 


Giy 




450 










4 b b 






lie 


Gl v 


Gly 


Phe 


Gin 


G 1 y 


Arg 


as n 


r ne 


4 65 










4 /U 








Asp 


Leu 


Leu 


Val 


Lys 


Tyr 


Tyr 


Mia 


ue u 








4 8 5 










Arg 


Ser 


His 


Lys 


Ala 


Thr 


Asp 


Gly 


T 1 £S 

lie 






500 










bU b 


Pre 


Asp 


Tyr 


Tyr 


Lys 


Glu 


Lys 


val 


Lys 






r i c 

bib 














i.ys 


Phe 


Leu 


Pro 


Tyr 


lie 


Tyr 


O r-i s~ 


T oil 

Leu 


5^0 










*3 c. 
j J D 






Gly 


His 


Pro 


Va 1 


He 


Arg 


fro 


Leu 


rile 


5-15 










CCA 

0 DU 








Asp 


Met 


Tyr 


Ara 

> 


lie 


G 1 U 


MS p 


Tin 


i yr 


















Tyr 


Ala 


Pro 


I le 


Va 1 


Ser 


Lys 


GlU 


GlU 








580 










DOD 


Arg 


Gly 


Lys 


Trp 


Tyr 


Asn 


Tyr 


Trp 


Asn 


595 










ouu 




Ser 


Val 


Val 


Lys 


Ser 


Thr 


His 


Pill 

Glu 


Leu 




610 










o 1 b 






Ser 


lie 


He 


Pro 


Leu 


GlU 


Gi y 


MSp 




625 










0 JU 








Ser 


Phe 


Lys 


Arg 


Tyr 


ASp 


ms n 


Mid 










645 










lie 


Lys 


Phe 


Ser 


Arg 


GlU 


lie 


Tyr 


Val 






660 










665 


G3u 


Lys 


Pro 


Va 1 


Ser 


Lys 


lie 


lie 


Val 




675 










680 




Va] 


Glu 


Lys 


Thr 


Met 


Gin 


Asn 


Thr 


Tyr 




690 








695 






lie 


Arg 


Gly 


Lys 


lie 


Asn 


Leu 


Glu 




70S 






710 









<210> 23 
< 2 1 1 > 469 
< 2 i 2 > PRT 

< 2 1 3 > Artificial Sequence 
<220> 

<223> synthetic 
<400> 2S 

Met Arg Val Leu Leu Val Ala Leu Ala 



PCT/US2004/007182 



Gin 


Phe 


Arg 


Asp 


ASp 


m rg 


1 Oil 

Lc U 








JD J 








HIS 


Tyr 


Leu 


Arg 


g l y 


uy 5 


mx g 
















Tyr 


Pro 


T a . • 

Leu 


Tyr 


GlU 


Mia 


rje t 


395 










HXJKJ 


His 


Arg 


Asn 


Glu 


T 1 « 

lie 


nu Q 

r ne 


lie 


4 10 








4lb 




Gin 


Arg 


Tyr 


B 1 _ 

Ala 


Phe 


lie 


Trp 










4 JU 






Asp 


Leu 


Lys 


Leu 


Gin 


Leu 


Gin 








/ yl C 
H H b 








Va 1 


Pro 


Phe 


val 


g l y 


uys 


M5 p 
















Ala 


GlU 


T 1 n 

lie 


ASp 


ms n 


v)Cl 


Mo t- 




4 7b 










d Rn 


n 1 _ 

Ala 


Leu 


Phe 


rhe 


D v 

riO 


rne 


T \f r* 

i yr 


4 9U 










H ~ J 




Asp 


i n r 


GlU 


r r o 


Vol 


nic 


T o i > 








D 1 U 






Glu 


lie 


Val 


GlU 


Leu 


Mr g 


i yr 
















Ala 


Leu 


GlU 


Ala 


be r 


flu 


Li y 9 






^ d n 










Tyr 


Glu 


r ne 


bin 


M 5p 






c. c c 

b b b 












Me t 


va i 


g i y 


Lys 


i yr 


Ton 

LC Li 


1 .P LI 


b/0 














Ser 


Arg 


Leu 


v a i 


i nr 


be U 


Pro 








con 

D?U 






Gly 


Glu 


I le 


lie 


T\. F^V 

Asn 


Gl y 


Lys 






fin ^ 








Pro 


lie 


Tyr 


T AM 

Leu 


Arg 


GlU 


g i y 






con 










Leu 


lie 


Val 


Tyr 


Gly 


Glu 


Thr 




635 








640 


lie 


Thr 


Ser 


Ser 


Ser 


Asn 


Glu 


650 










655 




Ser 


Lys 


Leu 


Thr 


lie 


Thr 


Ser 








670 






Asp 


Asp 


Ser 


Lys 


Glu 


He 


Gin 








685 








Val 


Ala 


Lvs 


lie 


Asn 


Gin 


Lys 






700 










Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 
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1 
1 








c, 
0 










10 










15 




Ala 


Th r- 

i n I 




net 

A 

^ u 


A 1 3 

M 1 a 


\J 1 u 


Php 


Phe 


Pro 


Glu 

w X. L* 


He 

IXC 


Pro 


L V<5 
lj y o 


lie 

x ^ w 

30 


Gin 


Phe 


Glu 


b i y 


by s 


b i u 


Ser 






Pro 


! .p it 


Ala 

~. x a 


Phe 


Ar o 

n x y 


Phe 

rut 


Tvr 

i y x 


ASD 


Pro 




^ c. 

J 3 




















4 5 








Sen 

r. b n 




V c3 .1 


lie 


ten 


Gly 




Pro 


Le u 


Lvs 


Asd 

*1 w 


His 


Le u 


LVS 


Phe 


Ser 




c n 

jU 








h^ 










60 










v a I 


Mia 


t ne 


Trp 


ni j 


Thr 


Php 


Va 1 


".Oil 


Glu 


Gl v 

ox y 


Ara 


Asd 


Pro 


Phe 


Giv 


00 








i n 

* 










75 










80 


MSp 


Pro 


i n r 


Ala 


n ) 1 1 

uiu 


li r n 




T r o 
1 r P 


A ^ n 


Am 


Phe 

CMC 


Ser 


Asd 


Pro 


Met 


Asp 








0 D 










7 V 










95 




Lvs 


Ala 


Phe 


Ala 


Arg 


Val 


Asp 


Ala 


Leu 


Phe 


Glu 


Phe 


Cys 


Glu 


Lys 


Leu 






i A A 








1 U D 










1 1 0 






Asn 


7 1 r. 

1 1 e 


b L U 


Tyr 


r n e 


b V 5 

m 


rne 


nib 


Acn 
MSp 


A r o 


ficn 

no p 


IXC 


Ala 

r\ x a 


Pro 


Glu 


Giv 

\jx y 






lib 






1 OA 
i t.\J 










X C V 








Lys 


Thr 


Leu 


Arg 


r l 
b 1 U 


i n r 


& c n 




He 


Leu 


Acn 


by D 


Va 1 

vox 


Val 

vox. 


Glu 


Ara 




1 "3 A 








i jj 




















; ie 


Lys 


r- l 1 1 
b 1 U 


M r g 


Met 


lj y b 


Ben 
Mop 


<^p r 


Asn 


Val 


T ,u <; 

lj y O 


Lpu 


Leu 

uc 


Tro 


Gl V 


Thr 


"i / c. 
I H D 






1 J L 










1 S5 










160 

4» V >r 


A 1 -, 

Mi a 




Leu 


r ne 


Oar 
I DJ 


n 1 5 


r ro 




Tyr 


Met 
170 


H i <? 

n x o 


Gl V 
y i y 


Ala 


Ala 


Thr 

x 1 1 4- 

175 


Thr 


Cys 


C ~-i »- 

ber 


Ala 


MSp 


Vol 


trie 


M » d 


T\/ r 

i yr 


Ala 


Ala 


Ala 

n x o 


Gin 


Val 


Lvs 


Lvs 


Ala 






1 E- A 
I C U 










185 










1 90 






Le u 


b I U 


lie 


. n r 


Lys 


1 J 1 'J 


I — t > 
JjCU 


y i y 


Gly Glu 


Gl v 

\j x y 


Tvr 


Val 


Phe 

kit w 


TrD 


Gly 






inc. 










?nn 










205 

C V J 








r* 1 . . 
b 1 y 


Arg 


b 1 U 


b i y 


Tvr 


1 U 


Thr 

Mil 


T o 1 1 


Leu 


Asn 


Thr 


Asd 


Leu 

w W 


Giv 

w x y 


Leu 

W V* 


Glu 




'< 1 A 


















270 










Leu 


GIu 


Asn 


Leu 


Ala 


Arg 


Phe 


Leu 


Arg 


Met 


Ala 


Val 


Glu 


Tyr 


Ala 


Lys 


"5 ~> C 




















235 










240 


LV S 


lie 


b i y 


Tilt? 


Thr 


ui y 


Gin 

VJ 1 1 i 


Phe 


Leu 


lie 


Glu 


Pro 


Lvs 


Pro 


Lys 


Glu 






? d s 

fc H J 










250 










255 




ii »■* 


i n i. 


L»y o 


r. o 


n 1 n 


Tyr 


A D 


Phe 


Asp 


Val 


Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 






> fin 

^ 0 v 










265 










270 






Le u 


Lys 


n5n 


"li 


o i y 


i p 1 1 




Glu 


Tyr 


Phe 


Lvs 


Phe 


Asn 


lie 


Glu 


Ala 




■J7C. 








P80 

t u u 










285 








Asn 


His 
l y u 


A 1 

. . x a 


Thr 


1-r C U 


Ala 


Gl v 


His 


Thr 


Phe 


Gin 


His 
300 


Glu 


Leu 


Arg 


Met 


r..I a 


*\ v- 


T 1 a 

lie 


T at » 
bcU 


u 1 y 


Lys 


T p u 


ui y 


Ser 


lie 


Asd 


Ala 


Asn 


Gin 


Gly 


Asp 


















315 










320 




Leu 


Leu 


Vj X > 


i rp 




Th r 


Aso 


Gin 


Phe 


Pro 


Thr 


Asn 


lie 


Tyr 


Asp 










1 P ^ 








330 










335 




J n r 


! n r 




A 1 ^ 
ni d 




Tyr 


n i u 


Va 1 


lie 


Lys 


Ala 


Gly Gly 


Phe 


Thr 


Lvs 


















345 










350 






bl y 


b i y 


Leu 


A c n 

M5 n 


Phe 


ficn 
Mo JJ 


M X © 


1 uc 


Val 


Arg 


Ar a 


Ala 


Ser 


Tvr 


Lvs 


Val 












ifin 

JOv 










365 








Glu 


A5D 


Leu 


Phe 


T 1 p 

lie 


« j i y 


nJ.3 


T 1 p 

IXC 


Ala 


Gly 


Met 


Asp 


Thr 


Phe 


Ala 


Leu 




4 

— i— 

3 m.j 








7 "7 ^ 










380 










Gly 


F he 


Lys 


lie 


A 1 , 

Mia 


Tyr 


Lys 


T oil 


Ala 


Lys 


A ^ n 

now 


Gly 


Val 


Phe 


ASD 


Lys 




















395 










400 


Php 






Gl u 


L v s 


Tvr 


Ar a 


Se r 


Phe 


Lys 


Glu 


Gly 


lie 


Gly 


Lys 


Glu 


















410 










415 




lie 


Val 


Glu 


Giv 


Lys 


Thr 


Asp 


Phe 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


lie 


He 








4 20 








425 










4 30 






Asp 


Lys 


Glu 


Asp 


lie 


Glu 


Leu 


Pro 


Ser 


Gly 


Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 


435 








440 










445 








Ser 


Leu 


Leu 


Asn 


Ser 


Tyr 


lie 


Val 


Lys 


Thr 


lie 


Ala 


Glu 


Leu 


Arg 


Ser 




450 








4 55 










4 60 










Glu 


Lys 


Asd 


Glu 


Leu 
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465 



<21C> 29 

<21!> 469 

<212> PRT 

<2\2> Artificial Sequence 
<210> 

< 2 2 3 > synthetic 



<A00> 29 



Met 


A r q 


Vol 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


1 






5 










10 


Ala 


Thr 


Ser 


Met 


Ala 


Glu 


Phe 


Phe 


Pro 


Glu 








20 










25 




Glu 


Gly 


Lys 


Glu 


Ser 


Thr 


Asn 


Pro 


Leu 


Ala 




35 










40 






Glu 


Glu 


lie 


He 


Asp 


Gly 


Lvs 

mj 


Pro 


Leu 


Lys 




50 










55 








Val 


Ala 


Phe 


Trp 


His 


Thr 


Phe 


Val 


Asn 


Glu 


65 










70 










Asp 


Pre 


Thr 


Ala 


Asp 


Arg 


Pro 


Trp 


Asn 


Arg 








85 










90 


Lys 


Ala 


Phe 


Ala 


Arg 


Val 


Asp 


Ala 


Leu 


Phe 






100 










105 




Asn 


lie 


Glu 


Tyr 


Phe 


Cys 


Phe 


His 


Asp 


Arg 






115 








120 






Lys 


Thr 


Leu 


Arg 


Glu 


Thr 


Asn 


Lys 


lie 


Leu 




130 








135 








I ie 


Lys 


Glu 


Arg 


Met 


Lys 


Asp 


Ser 


Asn 


Val 


145 








150 










Ala 


Asn 


Leu 


Phe 


Ser 


His 


Pro 


Arg 


Tyr 


Met 










165 










170 


Cys 


Ser 


Ala 


Asp 


Val 


Phe 


Ala 


Tyr 


Ala' 


Ala 






ISO 










135 




Leu 


Glu 


He 


Thr 


Lys 


Glu 


Leu 


Gly 


Gly 


Glu 






195 










200 






G 1 y 


Arg 


Glu 


Gly 


Tyr 


Glu 


Thr 


Leu 


Leu 


Asn 


210 










215 








Leu 


Glu 


Asn 


Leu 


Ala 


Arg 


Phe 


Leu 


Arg 


Met 


225 










230 








lie 


Arg 


lie 


Gly 


Phe 


Th r 


Gly Gin 


Phe 


Leu 






245 










250 


P ro 


Thr 


Lys 


His 


Gin 


Tyr 


Asp 


Phe 


Asp 


Val 






260 










265 




Leu 


Lys 


Ser 


His 


Glv 


Leu 


Asp 


Glu 


Tyr 


Phe 






275 








230 






Asn 


His 


Ala 


Thr 


Leu 


Ala 


Gly 


His 


Thr 


Phe 




290 










295 








Ala 


Arg 


lie 


Leu 


Gly 


Lys 


Leu 


Glv 


Ser 


He 


305 








310 








Phe 


Leu 


Leu 


Leu 


Gly 


Trp 


Asp 


Thr 


Asp 


Gin 








325 










330 


Thr 


Thr 


Leu 


Ala 


Met 


Tyr 


Glu 


Val 


lie 


Lys 








340 










345 




Gly 


Giy 


Leu 


Asn 


Phe 


Asp 


Ala 


Lys 


Val 


Arg 


355 










360 
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Leu 


Ala 


Leu 


Ala 


Ala 

15 


Ser 


He 


Pro 


Lys 


Val 
30 


Gin 


Phe 


Phe 


Lys 


Phe 
45 


Tyr 


Asp 


Pro 


Asp 


His 
60 


Leu 


Lys 


Phe 


Ser 


Gly Arg 


Asp 


Pro 


Phe 


Gly 


75 










80 


Tyr 


Thr 


Asp 


Pro 


Met 


Asp 








95 




Glu 


Phe 


Cys 


Glu 


Lys 


* 

Leu 






1 10 






Asp 


He 


Ala 


Pro 


Glu 


Gly 




125 








Asp 


Lys 
140 


mm *m 

Val 


Val 


Glu 


Arg 


Lys 


Leu 


Leu 


Trp 


Gly 


Thr 


155 










160 


His 


Gly 


Ala 


Ala 


Thr 


Thr 








17 5 




Ala 


Gin 


Val 


Lys 

i on 


• 

Lys 


Ala 


Gly 


Tyr 


Val 
205 


Phe 


Trp 


Gly 


Thr 


Asp 
220 


Leu 


Gly 


Phe 


Glu 


Ala 


Val 


Asp 


Tyr 


Ala 


Lys 


235 










240 


Glu 


Pro 


Lys 


Pro 


Lys 
255 


Glu 


Ala 


Thr 


Ala 


Tyr 
270 


Ala 


Phe 


Lys 


Phe 


Asn 


lie 


Glu 


Ala 




285 








Gin 


His 
300 


Glu 


Leu 


Arg 


Met 


Asp 


Ala 


Asn 


Gin 


Gly 


Asp 


315 










320 


Pro 


Thr 


Asn 


Val 


Tyr 
335 


Asp 


Ala 


Gly Gly 


Phe 


Thr 


Lys 








350 






Arg 


Ala 


Ser 


Tyr 


Lys 


Val 



365 
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Glu Asp Leu Phe 
370 

Gly Phe Lys Val 
333 

Phe lie Giu Glu 

Me Val Giu Gly 

4 20 

Aso Lys Glu Thr 
4 35 

Ser Leu I le Asn 
450 

Glu Lys Asp Glu 
465 



<210> 30 
< 2 1 1 > 463 
<2\2> PRT 
<2)3> Arcj ficia 

<220> 

<223> synthetic 

<40O> 30 
Met Arg Val Leu 
1 

Ala Thr Ser Met 

20 

Glu Gly Lvs Glu 
35 

Glu Glu Tie lie 
50 

Val Ala Phe Trp 

65 

Asp Pro Thr Ala 

Lys Ala Phe Ala 

100 

Asn lie Glu Tyr 
115 

Lys Thr Leu Arg 
130 

lie Lys Glu Arg 

145 

Ala Asn Leu Phe 

Cvs Ser Ala Asp 

ISO 

Leu Giu fie Thr 
195 

Giv Arg Giu Gly 
210 

Leu Glu Asn Leu 
225 

Arg lie Gly Phe 

Pro Thr Lys His 



T 1 

I le 


G 1 y 


HIS 


lie 






J / b 




Ala 


Tyr 


Lys 


Leu 




390 






Lys 


Tyr 


Arg 


Ser 


405 








Lys 


Val 


Aso 

* 


Phe 


lie 


Glu 


Leu 


Pro 








440 


Ser 


Tyr 


He 


Val 






455 





Leu 



Sequence 



Leu 


Val 


Ala 


Leu 


5 








Ala 


Glu 


Phe 


Phe 


Ser 


Thr 


Asn 


Pro 








40 


Asp 


Gly 


Lys 


Pro 






55 




His 


Thr 


Phe 


Val 




70 






Asp 


Arg 


Pro 


Trp 


85 








Arg 


Val 


Asp 


Ala 


Phe 


Cys 


Phe 


His 








120 


Glu 


Thr 


Asn 


Lys 






135 




Met 


Lys 


Asp 


Ser 




150 






Ser 


His 


Pro 


Arg 


165 








Val 


Phe 


Ala 


Tyr 


Lvs 


Giu 


Leu 


Gly 






200 


Tyr 


Giu 


Thr 


Leu 




215 




Ala 


Arg 


Phe 


Leu 




2 30 






Thr 


Gly 


Gin 


Phe 


245 








Gin 


Tyr 


Asp 


Phe 



Ala 


Gly 


Met 


Asp 








i o r\ 


Val 


Lys 


Asp 


Gly 






395 




Phe 


Arg 


Glu 


Gly 




410 






Glu 


Lys 


Leu 


Glu 


425 








Ser 


Gly 


Lys 


Gin 


Lys 


Thr 


lie 


Leu 








4 60 



Ala 


Leu 


Leu 


Ala 




10 






Pro 


Glu 


lie 


Pro 


25 








Leu 


Ala 


Phe 


Lys 


Leu 


Lys 


Asp 


His 








60 


Asn 


Glu 


Gly 


Arg 






75 




Asn 


Arg 


Tyr 


Thr 




90 






Leu 


Phe 


Glu 


Phe 


105 








Asp 


Arg 


Asp 


lie 


lie 


Leu 


Asp 


Lys 








140 


Asn 


Val 


Lys 


Leu 






155 




Tyr 


Met 


His 


Gly 




170 






Ala 


Ala 


Ala 


Gin 


185 








Glv 


Glu 


Gly 


Tyr 


Leu 


Asn 


Thr 


Asp 








220 


Arg 


Met 


Ala 


Val 






235 




Leu 


lie 


Glu 


Pro 




250 






Asp 


Val 


Ala 


Thr 



inr 


Php 


Al a 


1 Oil 


Val 


Leu 


Asp 


Lys 








400 


lie 


Gly 


Arg 


Asp 






415 




Glu 


Tyr 


lie 


lie 




4 30 






Glu 


Tyr 


Leu 


Glu 


445 








Glu 


Leu 


Arg 


Ser 



Leu 


Ala 


Ala 


Ser 






15 




Lys 


Val 


Gin 


Phe 




30 






Phe 


Tyr 


Asp 


Pro 


45 








Leu 


Lys 


Phe 


Ser 


Asp 


Pro 


Phe 


Gly 








80 


Asp 


Pro 


Met 


Asp 






95 




Cys 


Glu 


Lys 


Leu 




110 






Ala 


Pro 


Glu 


Gly 


125 








Val 


Val 


Glu 


Arg 


Leu 


Trp 


Gly 


Thr 








160 


Ala 


Ala 


Thr 


Thr 






175 




Val 


Lys 


Lys 


Ala 




190 






Val 


Phe 


Trp 


Gly 


205 








Leu 


Gly 


Phe 


Glu 


Asp 


Tyr 


Ala 


Lys 








240 


Lys 


Pro 


Lys 


Glu 






255 




Ala 


Tyr 


Ala 


Phe 
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260 




















9 1 n 






t 

Leu 


Lys 


Ser 


His 


Gly 


Leu 


Asp 


GIU 


1 yr 


r n e 


Lys 


rne 


Msn 


lie 


n. 1 ii 
uiu 


Zi 1 A 

Mia 




275 


















c O 0 








Asn 


His 

290 


Ala 


i n r 


Leu 


Aid 


r** 1 w 
uiy 




Th r 


Php 


y ill 


H i c. 




1 p n 


Oik r a 


Me t 


Ala 


Arg 


lie 


Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


lie 


Asp 


Ala 


Asn 


Gin 


Gly 


Asp 


"AC 






J 1 u 




















320 


Leu 


Leu 


Leu 


Gly 


T r d 


AS D 


Thr 


HSp 


bin 


rfic 


r C O 


i it l 


r, c- n 
Moil 


Va \ 


Tvr 
i yr 


















J Jw 










j ~j j 




Thr 


Thr 


Leu 


Ala 


Me t 


Tvr 


Glu 


v a J. 


T 1 o 


I lie 

uy 5 


Mia 


o i y 


vj i y 


Pho 
n i c 


Th r 










j4 0 


















i ^n 






uly 


Gly 


Leu 


Asn 


Phe 


Asp 


Aid 


Lys 


\ ; -j "i 

va i 


m r g 


nr g 


ZI 1 a 
Mia 


Cor 


Tvr 

l yr 


Li y o 


Va 1 
v a a 


355 










"3 C A 

360 










JD J 








Glu 


Asp 
370 


Leu 


Phe 


lie 


01 y 


HIS 

T ^ t 


T 1 e. 

lie 


ZI 1 a 

Mia 


uiy 


Mo f 
He L 


Zi c o 
n 

J O v 


Th r 


Php 




1 P 11 


G I y 


Phe 


Lys 


val 


Ala 


Tvr 


Lys 


Leu 


Val 


Lys 


Asp 


Gly 


val 


Leu 


Asp 


Lys 


j- 5 








"3 Ci f\ 

J ^ u 








J? j 










4 00 

*1 V V 


Phe 


lie 


Glu 


Giu 


Lvs 


Tvr 


Arg 


Ser 


Phe 


Arg 


Glu 


Gly 


lie 


Gly 


Arg 


Asp 










105 






410 










415 




lie 


val 


Giu 


Gly 


Lys 


Val 


Asp 


Phe 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


lie 


lie 








•120 








425 










430 






Asp 


Lys 


Glu 


Thr 


lie 


Glu 


Leu 


Pro 


Ser 


Gly 


Lys 


Gin 


Glu 


Tyr 


Leu 


Glu 


135 










440 










445 








Ser 


Leu 
450 


lie 


Asn 


Ser 


Tyr 


lie 
455 


Val 


Lys 


Thr 


lie 


Leu 
4 60 


Glu 


Leu 


Arg 





< 2 1 0 > 31 
<21i> 25 
<2 1 2 > PRT 

< 2 1 3 > Artificial Sequence 

< 2 2 0 > 

< 2 2 3 > synthetic 
<400> 31 

M«r Giv Lys Asn Glv Asn Leu Cys Cys Phe Ser Leu Leu Leu Leu Leu 
T b 10 15 

Leu Ala Gly Leu Ala Ser Gly His Gin 

20 25 



<210> 32 
<21 1 > 30 
< 2 1 2 > PRT 

< 2 1 3 > Artificial Sequence 
<220> 

< ? 2 3 > synthetic 
<400> 32 

Met Gly Phe Val Leu Phe Ser Gin Leu Pro Ser Phe Leu Leu Val Ser 

15 10 15 

Thr Leu Leu Leu Phe Leu Val lie Ser His Ser Cys Arg Ala 

20 25 30 



<210> 33 
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<21:> 4 60 
<212> PRT 
<21 3> Artificial 

<220> 

<223> synchetic 

<400> 33 
Met Arg Val Leu 
1 

Ala Thr Ser Ala 

20 

Gin Ala Phe Tyr 
35 

1 le Arg Gin Lys 

50 

Trp He Pro Pro 
65 

Tyr Asp Pro Tyr 

Thr Val Glu Thr 

100 

Asn Thr Ala His 
115 

Asn His Arg Ala 

1 30 

Tyr Thr Trp Thr 
14 5 

Asn Tyr Leu Asp 

Thr Phe Gly Gly 

180 

Tyr Trp Leu Trp 
195 

lie Gly He Asp 
210 

Trp Val Val Lys 
225 

Giu Tyr Trp Asp 

Ser Gly Ala Lys 

2 60 

Ala Phe Asp Asn 
275 

Gly Gly Thr Val 
290 

Aia Asn His Asp 
305 

Phe Tie Leu Thr 

Giu Glu Trp Leu 

340 

Asp Asn Leu Ala 
355 

Glu Mec lie Phe 
370 

Thr Tyr lie Asn 
335 



Sequence 



Leu 


Val 


Ala 


Leu 


5 








Lys 


Tyr 


Leu 


Glu 


Trp 


Asp 


Val 


Pro 








4 0 


lie 


Pro 


Glu 


Trp 






55 




Ala 


Ser 


Lys 


Gly 




70 






Aso 


Tyr 


Phe 


Asp 


85* 








Arg 


Phe 


Gly 


Ser 


Ala 


Tyr 


Gly 


lie 








120 


Gly 


Gly 


Asp 


Leu 






135 




Asp 


Phe 


Ser 


Lys 




150 






Phe 


His 


Pro 


Asn 


165 








Tyr 


Pro 


Asp 


He 


Ala 


Ser 


Gin 


Glu 








200 


Ala 


Trp 


Arg 


Phe 






215 




Asp 


Trp 


Leu 


Asn 




230 






Thr 


Asn 


Val 


Asp 


24 5 








Val 


Phe 


Asp 


Phe 


Lys 


Asn 


lie 


Fro 








280 


Va 1 


Ser 


Arg 


Asp 






295 




Thr 


Asp 


lie 


lie 




310 






Tyr 


Glu 


Gly 


Gin 


325 








Asn 


Lys 


Asp 


Lys 


Gly 


Gly 


Ser 


Thr 








360 


Val 


Arg 


Asn 


Gly 






375 




Leu 


Gly 


Se r 


Ser 




390 







Ala Leu Leu Ala 
10 

Leu Glu Glu Gly 
25 

Ser Gly Gly lie 

Tyr Asp Ala Gly 

60 

Met Ser Gly Gly 
75 

Leu Gly Glu Tyr 
90 

Lys Gin Glu Leu 
105 

Lys Val He Ala 

Glu Trp Asn Pro 

140 

Val Ala Ser Gly 
155 

Glu Leu His Ala 
170 

Cys His Asp Lys 
185 

Ser Tyr Ala Ala 

Asp Tyr Val Lys 

220 

Trp Trp Gly Gly 
235 

Ala Leu Leu Asn 
250 

Pro Leu Tyr Tyr 
265 

Ala Leu Val Glu 

Pro Phe Lys Ala 

300 

Trp Asn Lys Tyr 
315 

Pro Thr lie Phe 
330 

Leu Lys Asn Leu 
345 

Ser lie Val Tyr 

Tyr Gly Ser Lys 

380 

Lys Val Gly Arg 
395 



Leu 


Ala 


Ala 


Ser 






15 




Gly 


Val 


He 


Met 




30 






Trp 


Trp 


Asp 


Thr 


45 








He 


Ser 


Ala 


He 


Tyr 


Ser 


Met 


Gly 








80 


Tyr 


Gin 


Lys 


Gly 






95 




He 


Asn 


Met 


lie 




110 






Asp 


He 


Val 


He 


125 








Phe 


Val 


Gly 


Asp 


Lys 


Tyr 


Thr 


Ala 








160 


Gly 


Asp 


Ser 


Gly 






175 




Ser 


Trp 


Asp 


Gin 




190 






Tyr 


Leu 


Arg 


Ser 


205 








Gly 


Tyr 


Gly 


Ala 


Trp 


Ala 


Val 


Gly 








240 


Trp 


Ala 


Tyr 


Ser 






255 




Lys 


Met 


Asp 


Ala 




270 






Ala 


Leu 


Lys 


Asn 


285 








Val 


Thr 


Phe 


Val 


Pro 


Ala 


Tyr 


Ala 








320 


Tyr 


Arg 


Asp 


Tyr 






335 




He 


Trp 


He 


His 




350 






Tyr 


Asp 


Ser 


Asp 


365 








Pro 


Gly 


Leu 


He 


Trp 


Val 


Tyr 


Val 






400 
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Fro Lys Phe Ala Gly Ala Cys He His Glu Tyr Thr Gly Asn Leu Gly 

405 410 415 

Gly Trp Val Asp Lys Tyr Val Tyr Ser Ser Gly Trp Val Tyr Leu Glu 

420 425 430 

Ala Pro Ala Tyr Asp Pro Ala Asn Gly Gin Tyr Gly Tyr Ser Val Trp 

435 440 445 

Ser Tyr Cys Gly Val Gly Ser Glu Lys Asp Glu Leu 
4S0 455 460 

<2\0> 34 
<2!3> 825 
<2\2> PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3 > synthetic 
<400> 34 

Met Ara Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

1*5 10 15 

Ala Thr Ser Ala Gly His Trp Tyr Lys His Gin Arg Ala Tyr Gin Phe 

20 25 30 

Thr Gly Glu Asp Asp Phe Gly Lys Val Ala Val Val Lys Leu Pro Met 

35 40 45 

Asp Leu Thr Lys Val Gly lie lie Val Arg Leu Asn Glu Trp Gin Ala 

50 55 60 

Lvs Asp Val Ala Lys Asp Arg Phe He Glu lie Lys Asp Gly Lys Ala 
63 70 75 80 

Glu Val Trp He Leu Gin Gly Val Glu Glu lie Phe Tyr Glu Lys Pro 

3 5 90 95 

Asp Thr Ser Pro Arg Tie Phe Phe Ala Gin Ala Arg Ser Asn Lys Val 

100 105 110 

lie Glu Ala Phe Leu Thr Asn Pro Val Asp Thr Lys Lys Lys Glu Leu 

115 120 125 

Phe Lys Val Thr Val Asp Glv Lys Glu lie Pro Val Ser Arg Val Glu 

130 135 140 

Lys Ala Asp Pro Thr Asp lie Asp Val Thr Asn Tyr Val Arg lie Val 
145 150 155 160 

Leu Ser Glu Ser Leu Lys Glu Glu Asp Leu Arg Lys Asp Val Glu Leu 

165 170 175 

lie lie Glu Gly Tyr Lys Pro Ala Arg Val lie Met Met Glu lie Leu 

180 185 190 

Asp Asp Tyr Tyr Tyr Asp Glv Glu Leu Gly Ala Val Tyr Ser Pro Glu 

195 200 205 

Lvs Thr lie Phe Arg Val Trp Ser Pro Val Ser Lys Trp Val Lys Val 

210 215 220 

Leu Leu Phe Lvs Asn Glv Glu Asp Thr Glu Pro Tyr Gin Val Val Asn 
225 230 235 240 

Met Glu Tyr Lys Gly Asn Gly Val Trp Glu Ala Val Val Glu Gly Asp 

245 250 255 

Leu Asp Gly Val Phe Tyr Leu Tyr Gin Leu Glu Asn Tyr Gly Lys He 

260 265 270 

Arg Thr Thr Val Aso Pro Tyr Ser Lys Ala Val Tyr Ala Asn Asn Gin 

275 290 285 

Glu Ser Ala Val Val Asn Leu Ala Arg Thr Asn Pro Glu Gly Trp Glu 

290 295 300 

Asn Asp Arg Gly Pro Lys He Glu Gly Tyr Glu Asp Ala He He Tyr 
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305 



CI 1 1 1 


lie 


n I 5 


T 1 o 
lie 


Asn 


Lys 


Gly 


Leu 








340 


Gly 


Gly 


Val 


Thr 






355 




His 


Val 


His 


lie 




370 






Asp 


Lys 


Asp 


Phe 


385 








?he 


Me L 


val 


D /*\ 

r 1 O 


Th r 


Arg 


I ] » 


Arg 










Gly 


] ■ e 


Giy 


Val 




•1 3 5 




Gly 


Glu 


Leu 


Ser 




•150 






lie 


Asp 


Lys 


Thr 










i 1 6 s 


1. d 






Thr 


Tyr 


Tro 

* 


Val 






500 


Met 


Gly 


Leu 


lie 






5 i 5 




His 


Lys 


He 


Asp 




530 






Trp 


Gi y 


Ala 


Pro 


545 








vol 






Phe 


Phe 


Asn 


Pro 


Ser 








560 


Thr 


Lys 


lie 


Lvs 






595 




Leu 


Tie 


Lys 


Ser 




610 






Ala 


Cys 


His 


Asp 


625 








L y s 


Mi 3 


h. 5 p 


Lr y b 


Gin 


Lys 


Leu 


Ala 






660 


Leu 


H is 


Glv 


Gly 






675 




Ser 


Tyr 


Asn 


Ala 




690 






Leu 


Gin 


Phe 


lie 


"?05 








Arg 


Lvs 

4 


Glu 


His 


Lys 


r.lS 


Leu 


Glu 






740 


Leu 


Lys 


Asp 


His 






755 




Tvr 

m 


Asn 


Gly 


Asn 





310 






Ala 


Asp 


lie 


Thr 


325 








Tyr 


Leu 


Gly 


Leu 


Thr 


Gly 


Leu 


Ser 








360 


Leu 


Pro 


Phe 


Phe 






375 




Glu 


Lys 


Tyr 


Tyr 




390 






Glu 


Gly 


Arg 


Tyr 


4 05 








Glu 


Val 


Lys 


Glu 


lie 


Met 


Asp 


Met 








440 


Ala 


Phe 


Asp 


Gl n 






455 




Glv 


Ala 


Tyr 


Leu 


470 






Arg 


Pro 


Met 


Met 


435 








Lys 


Glu 


Tvr 

• 


His 


Asp 


Lys 


Lys 


Thr 








520 


Pro 


Thr 


lie 


lie 






535 




lie 


Arg 


Phe 


Giy 




550 






Asn 


Asp 


Glu 


Phe 


565 








Val 


Lys 


Gly 


Phe 


Arg 


Gly 


Val 


Val 








600 


Phe 


Ala 


Leu 


Asp 






615 




Asn 


His 


Thr 


Leu 




630 






Lys 


Lys 


Glu 


Trp 


645 








Gly 


Ala 


I le 


Leu 


Gin 


Asp 


Phe 


Cys 








680 


Pro 


lie 


Ser 


lie 






695 




Asd 


Val 


Phe 


Asn 


710 






Pro 


Ala 


Phe 


Arg 


725 








Phe 


Leu 


Pro 


Gly 


Ala 


Gly 


Gly 


Asp 








760 


Leu 


Glu 


Lys 


Thr 







315 




Gly 


Leu 


Glu 


Asn 




330 






Thr 


Glu 


Glu 


Asn 


345 








His 


Leu 


Val 


Glu 


Asp 


Phe 


Tyr 


Thr 








380 


Asn 


Trp 


Gly 


Tyr 






395 




Ser 


Thr 


Asp 


Pro 




410 






Met 


Val 


Lys 


Ala 


425 








Val 


Phe 


Pro 


His 


Thr 


val 


Pro 


Tyr 








4 60 


Asn 


Glu 


Ser 


Gly 






475 




Arg 


Lys 


Phe 


lie 




490 






lie 


Asp 


Gly 


Phe 


505 








Met 


Leu 


Glu 


Val 


Leu 


Tyr 


Gly 


Glu 








540 


Lys 


Ser 


Asp 


Val 






555 




Arg 


Asp 


Ala 


lie 




570 






Val 


Met 


Gly 


Giy 


585 








Gly 


Ser 


lie 


Asn 


Pro 


Glu 


Glu 


Thr 








620 


Trp 


Asp 


Lys 


Asn 






635 




Thr 


Glu 


Glu 


Glu 




650 






Leu 


Thr 


Ser 


Gin 


665 








Arg 


Thr 


Thr 


Asn 


Asn 


Gly 


Phe 


Asp 








700 


Tyr 


His 


Lys 


Gly 






715 




Leu 


Lys 


Asn 


Ala 




730 






Gly 


Arg 


Arg 


He 


745 








Pro 


Trp 


Lys 


Asp 


Thr 


Tyr 


Lys 


Leu 
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Ser 


Gly 


Val 


Lys 






335 




Thr 


Lys 


Ala 


Pro 




350 






Leu 


Gly 


Val 


Thr 


365 








Glv 


ASD 


Glu 


Leu 


Asp 


Pro 


Tyr 


Leu 








400 


Lys 


Asn 


Pro 


His 




415 




Leu 


His 


Lys 


His 




4 30 






Thr 


Tyr 


Gly 


He 


445 








Tvr 


Phe 


Tyr 


Arg 


Cys 


Gly 


Asn 


Val 








480 


Val 


Asp 


Thr 


Va 1 






495 




Arg 


Phe 


Asp 


Gin 




510 






Glu 


Arg 


Ala 


• 

Leu 


525 








Pro 


Trp 


Gly 


Gly 


Ala 


Gly 


Thr 


His 








560 


Arg 


Gly 


Ser 


Val 






575 




Tyr 


Gly 


Lys 


Glu 




590 






Tyr 


Asp 


Gly 


Lys 


605 








He 


Asn 


Tyr 


Ala 


Tyr 


Leu 


Ala 


Ala 






640 


Leu 


Lys 


Asn 


Ala 






655 




Gly 


Val 


Pro 


Phe 




670 






Phe 


Asn 


Asp 


Asn 


685 








Tyr 


Glu 


Arg 


Lys 


Leu 


He 


Lys 


Leu 








720 


Glu 


Glu 


lie 


Lys 






735 




Val 


Ala 


Phe 


Met 




750 






lie 


Val 


Val 


He 


765 








Pro 


Glu 


Gly 


Lys 



WO 21105/096804 



PCT/US2004/007182 



770 775 780 

Trp Asn Val Val Val Asn Ser Gin Lys Ala Gly Thr Glu Val He Glu 
735 790 795 800 

Thr Val Glu Gly Thr lie Glu Leu Asp Pro Leu Ser Ala Tyr Val Leu 

805 810 815 

Tvr Arg Glu Ser Glu Lys Asp Glu Leu 

820 825 



<210> 35 

< 2 1 1 > 460 

<212> PRT 

< 2 1 3 > Artificial Sequence 
<220> 

< 2 2 3 > synthetic 



<400> 35 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 


i 

J. 






mJ 










10 










15 




A la 


Th r 

i I i l 


Ser 


Ala 


Lvs 


Tvr 

i y l 


Le u 


Glu 


Leu 


Glu 


Glu 


Gl v 


Gl v 


Val 


lie 


Met 








20 








25 










30 






Gin 


Ala 


Phe 


Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 


lie 


Trp 


Trp Asp 


Thr 






35 










40 










4 5 








! ie 


Arg 


Gin 


Lys 


lie 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala 


Gly 
60 


lie 


Se r 


Ala 


lie 


Tr d 


lie 


P r O 


P r O 


Ala 


Se r 


Lvs 


Gl v 


Me t 


Ser 


Gly 


Gl v 


Tvr 


Ser 


Met 


Gly 












70 










75 










80 


T W f 
* J ' 


As o 


Pro 




Aso 


Tvr 


Phe 


As D 


Leu 


Gly 
90 


Glu 


Tvr 


Tvr 

i y i 


Gin 


Lys 
95 


Gly 


Tp r 


1 1 - 1 


G « u 


Thr 
100 


Arg 


Phe 


Gly 


Ser 


Lys 
105 


Gin 


Glu 


Leu 


lie 


Asn 
110 


Met 


lie 


Asn 


Thr 


Ala 
115 


His 


Ala 


Tyr 


Gly 


lie 
120 


Lys 


Val 


lie 


Ala 


Asp 
125 


lie 


Val 


lie 


Asn 


His 


Arg 


Ala 


Gly 


Gly 


Asp 


Leu 


Glu 


Trp Asn 


Pro 


Phe 


Val 


Gly Asp 




1 30 










135 










140 










Tyr 


Thr 


Trp 


Thr 


Asp 


Phe 


Ser 


Lys 


Val 


Ala 


Ser 


Gly 


Lys 


Tyr 


Thr 


Ala 


H5 










150 










155 










160 


Asn 


Tyr 


Leu 


Asp 


Phe 
165 


His 


Pro 


Asn 


Glu 


Leu 
170 


His 


Ala 


Gly 


Asp 


Ser 
175 


Gly 


Thr 


Phe 


Gly 


Gly 
180 


Tyr 


Pro 


Asp 


lie 


Cys 
135 


His 


Asp 


Lys 


Ser 


Trp 
190 


Asp 


Gin 


jyr 


Trp 


Leu 
195 


Trp 


Ala 


Ser 


Gin 


Glu 
200 


Ser 


Tyr 


Ala 


Ala 


Tyr 
205 


Leu 


Arg 


Ser 


He 


Gly 
210 


lie 


Asp 


Ala 


Trp 


Arg 
215 


Phe 


Asp 


Tyr 


Val 


Lys 
220 


Gly 


Tyr 


Gly 


Ala 


Tro 


Val 


Val 


Lvs 

* 


Asp 


Trp 


Leu 


Asn 


Trp 


Trp Gly 


Gly 


Trp 


Ala 


Val 


Gly 


225 










230 










235 










240 


Glu 


Tyr 


Trp 


Asp 


Thr 
245 


Asn 


Val 


Asp 


Ala 


Leu 
250 


Leu 


Asn 


Trp 


Ala 


Tyr 
255 


Ser 


Ser 


Gly 


Ala 


Lys 
260 


Val 


Phe 


Asp 


Phe 


Pro 
265 


Leu 


Tyr 


Tyr 


Lys 


Met 
270 


Asp 


Ala 


A 1 a 


Phe 


Asp 


Asn 


Lvs 


Asn 


lie 


Pro 


Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn 






275 








230 










285 








Gly 


Gly 


Thr 


Val 


Val 


Ser 


Arg 


Asp 


Pro 


Phe 


Lys 


Ala 


Val 


Thr 


Phe 


Val 


290 










295 










300 










A 1 a 


Asn 


His 


Asp 


Thr 


Asp 


lie 


lie 


Trp 


Asn 


Lys 


Tyr 


Pro 


Ala 


Tyr 


Ala 



305 310 315 320 
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Phe 


lie 


Leu 


Thr 


Tvr 


Glu 


Gly 


Gin 










325 








Glu 


Glu 


Trp 


Leu 


Asn 


Lys 


Asp 


Lys 








340 










Asp 


Asn 


Leu 


Ala 


Gly 


Gly 


Se r 


Thr 






355 










360 


Giu 


Met 


lie 


Phe 


Vai 


Arg 


Asn 


Gly 




370 










37 5 




Thr 


Tyr 


lie 


Asn 


Leu 


Glv 


Ser 


Ser 


385 










390 






Pro 


Lys 


Phe 


Ala 


Giy 


Ala 


Cys 


I le 








405 








Gly 


Trp 


Vai 


Asp 


Lys 


Tyr 


Val 


Tyr 








420 










Ala 


Pro 


Ala 


Tyr 


Asp 


Pro 


Ala 


Asn 






435 










440 


Ser 


Tyr 


Cys 


Giy 


Val 


Gly 


Ser 


Glu 




4 50 










455 





< 2 1 0 > 36 

<2l:> 718 

< 2 I 2 > PRT 

<?13> Artificial Sequence 
<220> 

< 2 2 3 > synthetic 



<400> 36 



Met 


Arg 


Val 


Leu 


Leu 


Val 


Ala 


Leu 


1 






5 








Ala 


Thr 


Ser 


Met 
20 


Glu 


Thr 


lie 


Lys 


Lys 


Vai 


Vol 


lie 


Gly 


Glu 


Pro 


Phe 














40 


Gin 


Lys 
50 


lie 


Ser 


Ser 


Asn 


Lys 
55 


Ser 


va 1 


Gin 


Gin 


Gly 


Asn 


Lys 


Val 


lie 


65 










70 






Giu 


His 


: Le 


rie 


Giy 
95 


Leu 


Gly 


Glu 


Arg 


Lys 


Arg 


Tvr 


Vai 


Met 


Tyr 


Asn 






100 










Tyr 


Gin 


Asp 


Pro 


Leu 


Tyr 


Val 


Ser 




115 










120 


As d Gly 


Vai 


Ala 


Thr 


Gly 


Tyr 


Phe 




1 30 










135 




Phe 


Asp 


Vai 


Giy 


Leu 


Glu 


Glu 


Tvr 


145 










150 






GJu 


Asp 


Ser 


Val 
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<210> 37 
<211> 14 34 
<212> DWA 

< 2 1 3 > Thermctoga maritima 
<400> 37 

atgaaagaaa ccgctgctgc taaattcgaa 
accctqgtgc cacgcqgttc catggccgag 
gagggcaagg agtccaccaa cccgctcgcc 
gacggcaagc cgctcaagga ccacctcaag 
aacgagcgcc gcgacccgtt cggcgacccg 
gacccgacgg acaaggcccc cgcccgcgtg 
aacatcgagc acttctgctt ccacgaccgc 
gagaccaaca agatccccga caaggtggtg 
aacgcgaagc tcctctgggg caccgccaac 
gccgccacca cctgctccgc cgacgtgttc 
ctggagatca ccaaggagct gggcggcgag 
tacgagaccc tcctcaacac cgaccccggc 
cgcatggccg tggagtacgc caagaagatc 
aagccgaagg agccgaccaa gcaccagtac 
cccaagaacc acggcctcga cgagtacttc 
ctcgccggcc acaccttcca gcacgagctg 
cccatcgacg ccaaccaggg cgacctcctc 
aacatctacg acaccaccct cgccatgtac 
ggcggcctca acttcgacgc caaggtgcgc 
atcggccaca tcgccggcat ggacaccttc 
gccaaggacq gcgtgcccga caagttcacc 
accggcaagg agaccgtgga gggcaagacc 
gacaaggagg acaccgagct gccgcccggc 
ccctacatcg tgaagaccac cgccgagctg 

<210> 39 

<21 1> 477 

<212> FRT 

< 2 1 3 > Thermocoga maritima 
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cgccagcaca tggacagccc agatctgggt 60 
ttcttcccgg agatcccgaa gatccagttc 120 
ttccgcttct acgacccgaa cgaggtgatc 180 
ttctccgtgg ccttctggca caccttcgtg 240 
accgccgagc gcccgtggaa ccgcttctcc 300 
gacgccctct tcgagttctg cgagaagctc 360 
gacatcgccc cggagggcaa gaccctccgc 420 
gagcgcatca aggagcgcat gaaggactcc 480 
ctcttctccc acccgcgcta catgcacggc 540 
gcctacgccg ccgcccaggt gaagaaggcc 600 
ggctacgtgt tctggggcgg ccgcgagggc 660 
ctggagctgg agaacctcgc ccgcttcctc 720 
ggcttcaccg gccagttcct catcgagccg 780 
gacttcgacg tggccaccgc ctacgccttc 840 
aagttcaaca tcgaggccaa ccacgccacc 900 
cgcatggccc gcatcctcgg caagctcggc 960 
ctcggctggg acaccgacca gttcccgacc 1020 
gaggtgatca aggccggcgg cttcaccaag 1080 
cgcgcctcct acaaggtgga ggacctcttc 1140 
gccctcggct ccaagatcgc ctacaagctc 1200 
gaggagaagt accgctcctt caaggagggc 1260 
gacttcgaga agctggagga gtacatcatc 1320 
aagcaggagt acctggagtc cctcctcaac 1380 
cgctccgaga aggacgagct gtga 1434 
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<210> 39 
<211> 243^ 
<2i2> DMA 

< 2 1 3 > Thermotcga neapolitana 
<400> 39 

atgaaagaaa ccqctgctgc taaattcgaa 
accctggtgc cacgcggttc catggecgag 
gagggcaagg agtccaccaa cccgctcgcc 
gaeggcaage cqctcaagga ccacctcaag 
aacgagggee gcgacccgtt cggcgacccg 
gacccgatgg acaaggccct cgcccgcgtg 



cgccagcaca tggacagccc agatctgggt 60 

ttcttcccgg agatcccgaa ggtgcagttc 120 

ttcaagttct acgacccgga ggagatcatc 180 

ttctccgtgg ccttctggca caccttcgtg 240 

accgccgacc gcccgtggaa ccgctacacc 300 

gacgccctct tcgagttctg cgagaagctc 360 
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aacatcgagt acttctgctt ccacgaccgc cacatcgccc cggagggcaa gaccctccgc 420 

gagaccaaca agatcctcga eaaqgtggtg gagcgcatca aggagcgcat gaaggactcc 480 

aacotaaagc tcctctgggg caccgccaac ctcttctccc acccgcgcta catgcacggc 540 

qccoccacca cctgctccgc cgacgtgttc gcctacgccg ccgcccaggt gaagaaggcc 600 

ctggagatca ccaaggagcc gggcggcgag ggctacgtgt tctggggcgg ccgcgagggc 660 

tacaagaccc ccctcaacac cgacctcggc ttcgagctgg agaacctcgc ccgcttcctc 720 

cgcatggccg tggactacgc caaqcgcatc ggcttcaccg gccagttcct catcgagccg 780 

aagccgaagg aaccgaccaa gcaccagtac gacttcgacg tggccaccgc ctacgcctcc 840 

ctcaagtccc acggcctcga cgagtacttc aagttcaaca tcgaggccaa ccacgccacc 900 

ctcgccgacc acaccttcca gcacgagctg cgcatggccc gcatcctcgg caagctcggc 960 

tccatcgacg ccaaccagqg cgacctcctc ctcggctggg acaccgacca gttcccgacc 1020 

aacgtgiacg acaccaccct cgccatgtac gaggtgatca aggccggcgg cttcaccaag 1080 

ggcggcc:ca acttcgacgc caaggtgcgc cgcgcctcct acaaggtgga ggacctcttc 1140 

atcggccaca ccgccggcat ggacaccttc gccctcggct tcaaggtggc ctacaagctc 1200 

gtaaagaacg cjcgtgctcga caagttcatc gaggagaagt accgctcctt ccgcgagggc 1260 

atcggccccg acaccgtgga gggcaaggtg gacttcgaga agctggagga gtacatcatc 1320 

gacaaggaga ccatcgagcc gccgtccggc aagcaggagt acccggagtc cctcatcaac 1380 

tcccacatcg tgaagaccat cctggagctg cgctccgaga aggacgagcc gtga 1434 

<210> 40 
<2H> 477 
< 2 1 2 > PRT 

<213> Thermotoga neapolitana 
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<210> 41 
< 2 1 1 > 1435 
<212> DNA 

< 2 1 3 > Thermotoga maritima 
<400> 41 

atggqcacca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60 

atggctagca tgactggtgg acagcaaatg qgteggatec ccatggccga gttcttcccg 120 

gagatcccga agatccagtt cgagggcaag gagtccacca acccgctcgc cttccgcttc 180 

tacgacccga acgaggtgat egaeggcaag ccgctcaagg accacctcaa gttctccgtg 240 

gccttctggc acaccttcgt gaacgagggc cgcgacccgt tcggcgaccc gaccgccgag 300 

cgcccgtgga accgcttctc cgacccgatg gaeaaggect tcgcccgcgt ggacgccctc 360 

ttcgagttct gegagaaget caacatcgag tacttctget tccacgaccg cgacatcccc 420 

eggagggcaa gaccctccgc gagaccaaca agatcctcga caaggtggtg gagegcatea 480 

aggaaegcat gaagaactcc aacgtgaagc tcctctgggg caccgccaac ctcttctccc 540 

acccgcgcta catgcacggc qccgccacca cctgctccgc cgacgtgttc gcctacgccg 600 

ccgcccaggt qaagaaggee ctggagatca ccaaggagct gggeggegag ggctacgtgt 660 

tetgggqegg ccgcgagggc tacgagaccc tcctcaacac cgacctcggc ctggagctgg 720 

agaacctccc ccgctccccc cgcatggccg tggagtaege caagaagatc ggcttcaccg 780 

gccagtccct catcoagccg aagecgaagg agccgaccaa gcaccagtac gettcgaegt 840 

ggccaccgcc tacgccttec tcaagaacca cggcctcgac gagtacttca agttcaacat 900 

cgaggccaac cacgccaccc tcgccggcca caccttccag cacgagctgc gcatggcccg 960 

catcctcgac aagcccggct ccatcgacgc caaccagggc gacctcctcc tcggctggga 1020 

caccqaccag ttcccgacca acatctacga caccaccctc gccatgtacg aggtgatcaa 1080 

ggccggcggc ctcaccaagq gcggccccaa cttcgacgcc aaggtgcgcc gcgcctccta 1140 

caagguggag gacctcttca tcggccacat cgccggcatg gacaccttcg ccctcggctt 1200 

caagatcgcc cacaagctcg ecaaggaegg cgtgttcgac aagttcatcg aggagaagta 1260 

ccgctccctc aaggagggca teggcaagga gatcgtggag ggcaagaccg acttcgagaa 1320 

geeggaggag tacaccatcg acaaggagga catcgagctg ccgtccggca agcaggagta 1380 
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cccggagtcc ctcctcaact cctacatc 

<210> 42 

<J?11> 4 73 

<2\2> PRT 

<213> Thermotoga maritima 

<4 00> 4 2 
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<210> 43 
<2H> 1436 
<212> DMA 

< 2 1 3 > Thermotoga neapolitana 
<400> 43 

atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60 

atqgctagca tgactggtgg acagcaaatg ggtcggatcc ccatggccga gttcttcccg 120 

gagatcccga aggtgcagtt cgagggcaag gagtccacca acccgctcgc cttcaagttc 180 

tacaacccgg aggagatcat cgacggcaag ccgctcaagg accacctcaa gttctccgtg 240 

gccttctggc acaccttcct gaacgagggc cgcgacccgc tcggcgaccc gaccgccgac 300 

cgcccgccga accgctacac cgacccgatg gacaaggcct tcgcccgcgt ggacgccctc 360 

ctcgagtcct gcgagaagct caacatcgag tacttctgct tccacgaccg cgacatcccc 420 

cggagggcaa gaccctccgc gagaccaaca agatcctcga caaggtggtg gagcgcatca 430 

aggagcgcat gaaggactcc aacgtgaagc tcctccgggg caccgccaac ctcttctccc 540 

acccgcgcta catgcacggc gccgccacca cctgctccgc cgacgtgttc gcctacgccg 600 

ccgcccaggt gaagaaggcc ctggagatca ccaaggagct gggcggcgag ggctacgtgt 660 

tctagggcgg ccgcgagggc tacgagaccc tcctcaacac cgacctcggc ttcgagctgg 720 

agaacctcgc ccgcttcccc cgcatggccg tggactacgc caagcgcatc ggcttcaccg 780 

gccagttcct catcgagccg aagccgaagg agccgaccaa gcaccagtac gacttcgacg 840 

tggccaccgc ccacgcctcc ctcaagtccc acggcctcga cgagtacttc aagttcaaca 900 

tcgaggccaa ccacgccacc cccgccggcc acaccttcca gcacgagctg cgcatggccc 960 

gcaccctcgg caagcccggc tccatcgacg ccaaccaggg cgacctcctc ctcggctggg 1020 

acaccgacca gttcccgacc aacgtgtacg acaccaccct cgccatgtac gaggtgatca 1080 

aggccggcqg cttcaccaag ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct 1140 

acaaggtgga gaacctcttc atcggccaca tcgccggcat ggacaccctc gccctcggct 1200 

ccaaggtqgc ctacaagctc gtgaaggacg gcgtgctcga caagttcatc gaggagaagt 1260 

accactcctt ccgcgagggc atcggccgcg acatcgtgga gggcaaggtg gacttcgaga 1320 

agctggagga gtacatcatc gacaaggaga ccatcgagct gccgtccggc aagcaggagt 1380 

acctggagtc cctcatcaac tcctacatcg tgaagaccat cctggagctg cgctga 1436 

<210> 44 
<211> 478 
<212> PRT 

< 2 1 3> Thermo*: oga neapolitana 
<400> 44 
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<?10> 46 
< 2 1 1 > 3285 
<212> ON A 

< 2 1 3 > Aspergillus shi rousami 
<400> 46 

QccaccccQQ ccgactggcg ctcccagccc atctacttcc tcctcaccga ccgcttcgcc 60 

cgcaccgacg gccccaccac cgccacctgc aacaccgccg accagaagta ctgcggcggc 120 

acctggcagg gcaccatcga caagctcgac tacatccagg gcatgggctt caccgccatc 180 

tgqatcaccc cggtgaccgc ccagctcccg cagaccaccg cctacggcga cgcctaccac 240 

ggctactggc agcaggacat ctactccctc aacgagaact acggcaccgc cgacgacctc 300 

aaggccccct cccccgccct ccacgagcgc ggcatgtacc tcatggtgga cgtggtggcc 360 

aaccacatgg gctacgacgg cgccggcccc tccgtggact actccgtgtt caagccgttc 420 

ccctcccagg actacttcca cccgttctgc ttcatccaga actacgagga ccagacccag 480 

gtggaggact gctggctcgg cgacaacacc gcgtccctcc cggacctcga caccaccaag 540 

gacgtggtga agaacgagtg gtacgactgg gtgggctccc tcgtgtccaa ctactccatc 600 

gacggccccc gcatcgacac cgtgaagcac gtgcagaagg acttctggcc gggctacaac 660 

aaggccgccg gcgtgcactg catcggcgag gtgctcgacg tggacccggc ctacacctgc 720 

ccgcaccaga acgtgatgga cggcgtgctc aaccacccga tctactaccc gctcctcaac 780 

gccttcaagt ccacctccgg ctcgatggac gacccctaca acatgatcaa caccgtgaag 840 

tccgaccgcc cggaccccac cctcctcggc accttcgtgg agaaccacga caacccgcgc 900 

cccgccccct acaccaacga catcgccccc gccaagaacg cggccgcctt catcatcctc 960 

aacgacggca tcccgatcat ctacgccggc caggagcagc actacgccgg cggcaacgac 1020 

ccggccaacc gcgaggccac ctggctctcc ggctacccga ccgactccga gctgtacaag 1080 

cccatcgcct ccgccaacgc catccgcaac tacgccatct ccaaggacac cggcttcgtg 1140 

acctacaaga actggccgat ctacaaggac gacaccacca tcgccatgcg caagggcacc 1200 

gacggccccc agatcgtgac catcctctcc aacaagggcg cctccggcga ctcctacacc 1260 

cccticcccct ccggcgccgg ctacaccgcc ggccagcagc tcaccgaggt gatcggctgc 1320 

accaccgcca ccgtgggccc cgacggcaac gtgccggtgc cgacggccgg cggcctcccg 1380 

cgcgtgctcc acccgaccga gaagcccgcc ggctccaaga tatgctcctc ctccaagccg 1440 

gccaccctcg actcccggcc ccccaacgag gccaccgtgg cccgcaccgc catcctcaac 1500 

aacaccggcg ccgacggcgc ctgggtgccc ggcgccgact ccggcatcgt ggtggcctcc 1560 

ccgcccaccg acaacccgga ctacttctac acccggaccc gcgactccgg catcgtgctc 1620 
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aagaccctcg tggacctctt ccgcaacggc gacaccgacc tcctctccac "catcgagcac 1680 

tacccctcct cccaggccat catccagggc gtgcccaacc cgtccggcga cctctcctcc 1740 

ggcggcctcg gcgagccgaa qttcaacgtg gacgagaccg cctacgccgg ctcctggggc 1800 

cgcccgcagc gcgacggccc ggccctccgc gccaccgcca tgatcggctt cggccagcgg 1860 

ctcctcgaca acggctacac ctccgccgcc accgagatcg tgtggccgct cgtgcgcaac 1920 

gacctctccc acgtggccca gtactggaac cagaccggct acgacctctg ggaggaggtg 1980 

aacggctcct ccttcttcac catcgccgtg cagcaccgcg ccctcgtgga gggccccgcc 2040 

tccgccaccg ccgtgggccc ctcccgctcc tggcgcgacc cccaggcccc gcagatcctc 2100 

tgccacctcc agtccttctg gaccggctcc tacatcctcg ccaacttcga ctcctcccgc 2160 

cccggcaagg acaccaacac cctcctcggc cccatccaca ccttcgaccc ggaggccggc 2220 

tgcgacgact ccaccttcca gccgtgctcc ccgcgcgccc tcgccaacca caaggaggtg 2280 

gcggacccct tccgctccat ctacaccctc aacgacggcc tctccgactc cgaggccgtg 2340 

gccgcgggcc gctacccgga ggactcctac tacaacggca acccgtggtt cctctgcacc 2400 

ctcgccgccg ccgagcagct ctacgacgcc ctctaccagt gggacaagca gggctccctg 2460 

gacatcaccg acgcgtccct cgactcctcc aaggccctct actccggcgc cgccaccggc 2520 

acccacccct ccccctcctc cacctactcc tccatcgtgt ccgccgtgaa gaccttcgcc 2580 

gacggctccg tgtccatcgt ggagacccac gccgcctcca acggctccct ctccgagcag 2640 

ttcgacaagt ccgacggcga cgagctgtcc gcccgcgacc ccacctggtc ctacgccgcc 2700 

cccctcaccg ccaacaaccg ccgcaactcc gtggcgccgc cgtcctgggg cgagacctcc 2760 

gcctcctccg cgccgggcac ctgcgccgcc acccccgcct ccggcaccta ctcctccgtg 2820 

accgcgaccc cctggccgtc catcgtggcc accggcggca ccaccaccac cgccaccacc 2880 

accggctccg gcggcgtgac ctccacctcc aagaccacca ccaccgcctc caagacctcc 2940 

accaccaccz cccccacctc ctgcaccacc ccgaccgccg tggccgtgac cttcgacctc 3000 

accgccacca ccacctacgg cgagaacatc tacctcgtgg gctccatctc ccagctcggc 3060 

gaciaggaga cctccgacgg catcgccctc Cccgccgaca agtacacctc ctccaacccg 3120 

ccgtggtacg cgaccqtgac ccccccggcc ggcgagtcct tcgagtacaa gctcatccgc 3180 

gtggagcccg acgaccccgt ggagtgggag tccgacccga accgcgagta caccgtgccg 3240 

caggcctgcg gcgagtccac cgccaccgtg accgacacct ggcgc 3285 

<210> 47 
<211> 679 
<212> PRT 

< 2 1 3 > Thermoanaerobacteri urn t hermosaccharolyt icum 
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645 650 655 

Ala Glu Tyr Val lie Leu Phe Ala Ser Asn lie Glu His Lys Val Leu 

660 665 670 

Asp Met Pro Asp lie Val Tyr 
675 



<210> -18 
< 2 1 1 > 2037 
<212> DNA 

< 2 1 3 > Thermoanaerobact er ium t hermosaccha rol yt icum 

<22C> 

< 2 2 3 > synthetic 

<400> 48 

gtgctctccg gctgctccaa caacgtgtcc tccatcaaga tcgaccgctt caacaacatc 60 

tccgccgtga acggcccggg cgaggaggac acctgggcct ccgcccagaa gcagggcgtg 120 

ggcaccgcca acaactacgt gtcccgcgtg tggttcaccc tcgccaacgg cgccatctcc 180 

gaggtgtact acccgaccat cgacaccgcc gacgtgaagg agatcaagtt catcgtgacc 240 

gacggcaagt cctccgcgtc cgacgagacc aaggacgcca tctccaaggt ggagaagtcc 300 

accgacaagt ccctcggcta caagctcgtg aacaccgaca agaagggccg ctaccgcatc 360 

accaaggaaa tcttcaccga cgtgaagcgc aactccctca tcatgaaggc caagttcgag 420 

gccctcgagg gctccatcca cgactacaag ctctacctcg cctacgaccc gcacatcaag 480 

aaccagggct cctacaacga gggctacgtg atcaaggcca acaacaacga gatgctcatg 540 

gccaagcgcg acaacgtgta caccgccctc tcctccaaca tcggctggaa gggctactcc 600 

atcggctact acaaggtgaa cgacatcatg accgacctcg acgagaacaa gcagatgacc 660 

aagcactacg actccgcccg cggcaacatc atcgagggcg ccgagatcga cctcaccaag 720 

aactccgagc tcgagatcgt gctctccttc ggcggctccg actccgaggc cgccaagacc 780 

gccctcgaga ccctcggcga ggactacaac aacctcaaga acaactacat cgacgagtgg 840 

accaagtact gcaacaccct caacaacctc aacggcaagg ccaactccct ctactacaac 900 

tccatgatga tcctcaaggc ctccgaggac aagaccaaca agggcgccta catcgcctcc 960 

etctecatcc cgcggggcga cggccagcgc gacgacaaca ccggcggcta ccacctcgcg 1020 

tggtcccccg acctctacca cgtggccaac gccttcatcg ccgccggcga cgtggactcc 1080 

qccaacccct ccctcgacta cctcgccaag gtggtgaagg acaacggcat gatcccgcag 1140 

aacaccccga tctccggcaa gccgtactgg acctccatcc agctcgacga gcaggccgac 1200 

ccgatcatcc cctcctaccg cctcaagcgc tacgacctct acgactccct cgtgaagccg 1260 

cicgccgact ucatcatcaa gatcggcccg aagaccggcc aggagcgctg ggaggagatc 1320 

ggcggctact ccccggccac gatggccgcc gaggtggccg gcctcacctg cgccgcctac 1380 

aucgccgagc agaacaagga ctacgagtcc gcccagaagt accaggagaa ggccgacaac 1440 

tggeagaagc tcatcgacaa cctcacctac accgagaacg gcccgctcgg caacggccag 1500 

cactacatcc gcatcgccgg cctctccgac ccgaacgccg acttcatgat caacatcgcc 1560 

aacggcggcg gcgtgtacga ccagaaggag atcgtggacc cgtccttcct cgagctggtg 1620 

cgccccggcg tgaagtccgc cgacgacccg aagatcctca acaccctcaa ggtggtggac 1680 

cccaccacca aggtggacac cccgaagggc ccgtcctggc atcgctacaa ccacgacggc 1740 

cacggcgagc cgtccaagac cgagctgtac cacggcgccg gcaagggccg cctctggccg 1800 

ctccccaccg gcgagcgcgg catgtacgag atcgccgccg gcaaggacgc caccccgtac 1860 

gtgaaggcga tggagaagct cgccaacgag ggcggcatca tctccgagca ggtgtgggag 1920 

gacaccggcc tcccgaccga ccccgcctcc ccgctcaact gggcccacgc cgagtacgtg 1980 

accctctccg cctccaacat cgagcacaag gcgctcgaca cgccggacat cgtgtac 2037 

<210> 49 
<2 1 1 > 579 
<212> PRT 

< 2 1 3 > Rhi20pus oryzae 
<400> 49 

Ala Ser Tie Pro Ser Ser Ala Ser Val Gin Leu Asp Ser Tyr Asn Tyr 
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ms n 


v a 1 


Asn 


Gly 


Val 


His 








7 0 n 


Leu Gly 


Ala 


Asp 






335 




Thr 


Tyr 


Ser 


Ser 




350 






Trp 


Val 


Ser 


Ser 


365 








Gly 


Val 


Ser 


Lys 


Leu Gly 


Ser 


Val 








& nn 


Leu 


Ala 


Thr 


Ala 






415 




lie 


Asn 


Lys 


Asn 




430 






Pro 


Glu 


Asp 


Thr 


445 








Phe 


Leu 


Ala 


Val 


Glu 


Trp 


lie 


Gly 
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*J / u 










A 1 ^ 










4 80 


Asn 


Gl y 


Gl y 


Va 1 


i h r 


va l 


be r 


be r 


T \ a 

lie 


be r 


Leu 


riO 


rfic 


flic 




T v ^ 

by 5 






4 o _> 










don 










4 95 
i > j 




Phe 


Asp 


I** . ~. 

oer 


Ser 


Ala 


i n r 


be r 


u i y 


L»y 5 


Ly b 


j yr 


Jill 


vol 




Thr 








500 










505 










510 






Asp 


Phe 


Asn 


Asn 


Leu 


Ala 


Gin 


Asn 


lie 


Ala 


Leu 


Ala 


Ala 


Asp 


Arg 


Phe 




515 










520 










525 








Leu 


Ser 


Thr 


Val 


Gin 


Leu 


His 


Ala 


His 


Asn 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 




530 










535 










540 








Thr 


•31 u 


Phe 


Asp 


Arg 


Thr 


Thr 


Gly 


Leu 


Ser 


Thr 


Gly 


Ala 


Arg 


Asp 


Leu 


5<15 






550 










555 










560 


Trp 


Ser 


His 


Ala 


Ser 


Leu 


lie 


Thr 


Ala 


Ser 


Tyr 


Ala 


Lys 


Ala 


Gly 


Ala 






565 










570 










575 





Pro Ala A) a 



< 2 1 0 > 50 
<211> 1737 
< 2 1 2 > DNA 

< 2 1 3 > Rhizopus ory2ae 

<400> 50 

gcctccatcc cgtcctccgc ctccgtgcag ctcgactcct acaactacga cggctccacc 60 

ttctccggca aaatctacgt gaagaacatc gcctactcca agaaggtgac cgtgatctac 

gccgacggct ccgacaactg gaacaacaac ggcaacacca tcgccgcctc ctactccgcc 

ccgatctccg gctccaacta cgagtaccgg accttctccg cctccatcaa cggcatcaag 

qagttctaca tcaagtacga ggcgcccggc aagacccact acgacaacaa ■ caactccgcc 

aactaccagq tgtccacccc caagccgacc accaccaccg ccaccgccac caccaccacc 360 

gccccgtcca cctccaccac caccccgccg tcccgctccg agccggccac cttcccgacc 420 

ggcaactcca ccauctcctc ccggaccaag aagcaggagg gcatctcccg cttcgccatg 

ctccgcaaca tcaacccgcc gggctccgcc accggcttca tcgccgcctc cctctccacc 

gcccgcccgg actaccacta cgcctggacc cgcgacgccg ccctcacctc caacgtgatc 600 

gugtacgagt acaacaccac cctctccggc aacaagacca tcctcaacgt gctcaaggac 

tacgcgacct tctccgtgaa gacccagtcc acctccaccg tgtgcaactg cctcggcgag 

ccgaagttca acccggacgc ctccggctac accggcgcct ggggccgccc gcagaacgac 

ggcccggccg agcgcgccac caccttcatc ctcttcgccg actcctacct cacccagacc 840 

aaggacgccc cctacgtgac cggcaccctc aagccggcca tcttcaagga cctcgactac 

atggcgaacg tgtggtccaa cggctgcttc gacctctggg aggaggtgaa cggcgtgcac 

ttccacaccc tcatggtgat gcgcaagggc ctcctcctcg gcgccgactt cgccaagcgc 

aacggcgact ccacccgcgc ctccacctac tcctccaccg cctccaccat cgccaacaaa 1080 

atctccucct tctggatgtc ctccaacaac tggatacagg tgtcccagtc cgtgaccggc 1140 

gocgtgtcca agaagggcct cgacgtgtcc accctcctcg ccgccaacct cggctccgtg 1200 

qacgacggct tcttcacccc gggctccgag aagatcctcg ccaccgccgt ggccgtggag 1260 

oactccttcg cctccctcta cccgatcaac aagaacctcc cgtcctacct cggcaactcc 1320 



120 
180 
240 
300 



480 
540 
600 
660 
720 
780 



900 
960 
1020 



atcggccgct acccggagga cacctacaac ggcaacggca actcccaggg caactcctgg 1380 

ttccccgcca tgaccggcta cgccgagctg tactaccgcg ccatcaagga gtggatcggc 1440 

aacgccggcc tgaccgcgtc ctccatctcc ctcccgttct tcaagaagtt cgactcctcc 1500 

gccacctccg gcaagaagta caccgtgggc acctccgact tcaacaacct cgcccagaac 1560 

atcgccctcg ccgccgaccg cttcctctcc accgtgcagc tccacgccca caacaacggc 1620 

tccctcgccg aggagttcga ccgcaccacc ggcctctcca ccggcgcccg cgacctcacc 1680 

tggtcceacg cctccctcat caccgcctcc tacgccaagg ccggcgcccc ggccgcc 1737 

<2 1 0> 51 
<2!1> 439 
< 2 i 2 > PF.T 

< 2 1 3 > Artificial Sequence 
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<220> 

<223> synthetic 



<400> 51 



Met 


Ala 


Lys 


His 


Leu 


Ala 


Ala 


Met 


Cys 


Trp Cys 


Ser 


Leu 


Leu 


Val 


Leu 


1 






5 










10 










15 




val 


Leu 


Leu 


Cys 


Leu 


Gly 


Ser 


Gin 


Leu 


Ala 


Gin 


Ser 


Gin 


Val 


Leu 


Phe 








20 








25 










30 






Gin 


Gly 


?he 


Asn 


Trp 


Glu 


Ser 


Trp 


Lys 


Lys 


Gin 


Gly 


» -v 

Gly 


Trp 


Tyr 


Asn 




35 










40 










4 5 








Tyr 


Leu 
50 


Leu 


Gly 


Arg 


Val 


Asp 
55 


Asp 


He 


Ala 


Ala 


Thr 
60 


Gly 


Ala 


Thr 


His 


Val 


Trp 


Leu 


Pro 


Gin 


Pro 


Ser 


His 


Ser 


Val 


Ala 


Pro 


Gin 


Gly 


Tyr 


Met 


65 








70 










75 










80 


Pro Glv 


Arg 


Leu 


Tyr 


Asp 


Leu 


Asp 


Ala 


Ser 


Lys 


Tyr 


Gly 


Thr 


His 


Ala 








8 5 










90 










95 




Giu 


Leu 


Lys 


Ser 


Leu 


Thr 


Ala 


Ala 


Phe 


His 


Ala 


* 

Lys 


Gly 


Val 


Gin 


Cys 






100 










105 










110 






Val 


TV \ - 

n, 1 a 


As D 

* 


Vai 


vai 


He 


Asn 


His 


Arg 


Cys 


Ala 


Asp 


Tyr 


Lys 


Asp Gly 






115 










1 20 










IOC 

125 








Arq 


Gly 


lie 


Tyr 


Cys 


Va 1 


Phe 


Glu 


Gly Gly 


Thr 


Pro 


Asp 


Ser 


TV v 

Mr g 


T r\ i \ 

be u 


130 










135 










i /i a 
1 4 U 










Asp 


Tro 

* 


Gly 


Pro 


Asp 


Met 


I ] e 


Cys 


Ser 


Asp 


ASp 


Thr 


Gin 


Tyr 


Cor 

gc r 


nsn 


145 






1 50 










lbb 










1 &C\ 
1 OU 


Gly Arq 


Gly 


His 


Arg 


Asp 


Th r 


Gly 


Ala 


Asp 


Phe 


Ala 


Ala 


Ala 


rro 










1 65 










170 










l / j 




lie 


Asp 


Ha s 


Leu 


Asn 


Pro 


Arg 


Val 


Gin 


Gin 




Leu 


Ser 


ASp 


Trp 


Lcll 






180 










185 










T Q A 






Asn 


Trp 


Leu 


Lys 


Ser 


Asp 


Leu 


Gly 


Phe 


Asp 


Gl y 


Trp 


Arg 


Leu 


Asp 


Phe 




195 










200 










205 








Ala 


Lys 


Gly 


Tyr 


Ser 


Ala 


Ala 


Val 


Ala 


Lys 


va 1 


Tyr 


Val 


ASp 


Ser 


Thr 




210 






215 










"D O A 










Ala 


Pro 


Thr 


Phe 


Val 


Val 


Ala 


Glu 


He 


Trp 


Se r 


C ^ w- 

Ser 


Leu 


His 


Tyr 


Asp 


225 










230 










O 1 c 










240 


Gly 


Asn 


Gly 


Glu 


Pro 


Ser 


Ser 


Asn 


Gin 


Asp 


Al a 


Asp 


Arg 


Gin 


Glu 


Leu 






245 










250 










255 




Val 


Asn 


Trp 


Ala 


Gin 


Ala 


Va 1 


Gly Gly 


Pro 


Ala 


Ala 


Ala 


Phe 


Asp 


Phe 






260 










265 










OTA 






Thr 


Thr 


Lvs 


Gly 


Val 


Leu 


Gin 


Ala 


Ala 


Val 


Gin 


Gly 


Glu 


T A ■ • 

Leu 


Trp 


Arg 






275 








280 










IOC 

28 5 








Met 


Lvs 


Asp 


Glv 

« 


Asn 


Gly 


Lys 


Ala 


Pro 


Gly 


Met 


He 


Gly 


Trp 


Leu 


Pro 




290 








a a r 

2 95 










■5 Art 

300 










Glu 


Lys 


Ala 


Va I 


Thr 


Phe 


Va 1 


Asp 


Asn 


His 


Asp 


Thr 


Gly 


Ser 


Thr 


Gin 


305 








310 










315 










320 


Asp. 


Ser 


Trp 


Pro 


Phe 


Pro 


Ser 


Asp 


Lys 


Vai 


Met 


Gin 


Gly 


Tyr 


Ala 


Tyr 








325 










330 










335 




I le 


Leu 


Thr 


His 


Pro 


Gly 


Thr 


Pro 


Cys 


He 


Phe 


Tyr 


Asp 


His 


Val 


Phe 








340 








345 










350 






Asp 


Tro 

* 


Asn 


Leu 


Lys 


Gin 


Giu 


He 


Ser 


Ala 


Leu 


Ser 


Ala 


Val 


Arg 


Ser 


355 








360 










365 








Arg 


Asn 


Gly 


He 


H-s 


Pro 


Glv 


Ser 


Giu 


Leu 


Asn 


He 


Leu 


Ala 


Ala 


Asp 


370 








375 










380 










Gi v 


Asp 


Leu 


Tyr 


Val 


Ala 


Lys 


lie 


Asp Asp 


Lys 


Val 


He 


Val 


Lys 


lie 


365 






390 










395 










400 


Gly 


Ser 


Arg 


iyr 


Asp 


Val 


Gly 


Asn 


Leu 


He 


Pro 


Ser 


Asp 


Phe 


His 


Ala 




4 05 










410 










415 




Val 


.*■*. 1 a 


His 


Gly 
420 


Asn 


Asn 


Tyr 


Cvs 

m 


Val 
425 


Trp 


Glu 


Lys 


His 


Gly 
430 


Leu 


Arg 
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Val Fro Ala Glv Arg His His 
4 35 



<210> 52 
<2i!> 1320 
<2!2> DNA 

<2\2> Artificial Sequence 
<220> ■ 

< 2 2 3 > synthetic 
<400> 52 

acggcgaagc acttgqctgc catgtgccgg tgcagcctcc tagtgcttgt actgctctgc 60 

tcgggctccc agccggccca atcccaggtc ctcttccagg ggttcaactg ggagtcgtgg 120 

aagaagcaag gtgggtggta caactacctc ctggggcggg tggacgacat cgccgcgacg 180 

ggggccacgc acgtccggct cccgcagccg tcgcactcgg tggcgccgca ggggtacatg 240 

cccggccggc tctacgacct ggacgcgtcc aagtacggca cccacgcgga gctcaagtcg 300 

ctcaccgcgg cgctccacgc caagggcgtc cagtgcgtcg ccgacgtcgt gatcaaccac 360 

cgctgcgccg actacaagga cggccgcggc atctactgcg tcttcgaggg cggcacgccc 420 

gacagccgcc ccgactgggg ccccgacatg atctgcagcg acgacacgca gtactccaac 480 

gggcgcgggc accgcgacac gggggccgac ttcgccgccg cgcccgacat cgaccacctc 540 

aacccqcgcg cgcagcagga gctctcggac tggctcaact ggctcaagtc cgacctcggc 600 

ttcgacuqct ggcgcctcga cttcgccaag ggccactccg ccgccgtcgc caaggtgtac 660 

gtcgacagca ccqcccccac cttcgtcgtc gccgagatat ggagctccct ccactacgac 720 

ggcaacqgcg agccgtccag caaccaggac gccgacaggc aggagctggt caactgggcg 780 

caggcqcjtqg gcggccccgc cgcggcgttc gacttcacca ccaagggcgt gctgcaggcg 840 

gccgtccagg gcgacctqtg gcgcatgaag gacggcaacg gcaaggcgcc cgggatgatc 900 

ggctggctgc cggacaaggc cgtcacgttc gtcgacaacc acgacaccgg ctccacgcag 960 

aactcgtggc cattcccctc cgacaaggtc atgcagggct acgcctatat cctcacgcac 1020 

ccaggaactc catgcatctt ccacgaccac gctttcgact ggaacctgaa gcaggagatc 1080 

agcgcgctgt ctgcggtgag gtcaagaaac gggatccacc cggggagcga gctgaacatc 1140 

ctcgccgccg acggggatct ctacgtcgcc aagattgacg acaaggtcat cgtgaagatc 1200 

gggtcacggt acgacgtcgg gaacccgatc ccctcagact tccacgccgt tgcccctggc 1260 

aacaaccact gcgtttggga gaagcacggt ctgagagttc cagcggggcg gcaccactag 1320 



<210> 53 
<211> 45 
< 2 1 2 > PRT 

<213> Artificial Sequence 
<220> 

< 2 2 3> synthetic 
<<100> 53 

Ala Thr Glv Gly Thr Thr Thr Thr Ala Thr Thr Thr Gly Ser Gly Gly 

i 5 10 15 

Val Thr Ser Thr Ser Lys Thr Thr Thr Thr Ala Ser Lys Thr Ser Thr 

20 25 30 

Thr Thr Ser Ser Thr Ser Cys Thr Thr Pro Thr Ala Val 
35 40 45 



<210> 54 

<21!> 13"? 

<212> DNA 

< 2 1 3 > Artificial Seouence 
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<220> 

< 2 2 3 > synthetic 
<400> 54 

gccaccggcg gcaccaccac caccgccacc accaccggct ccggcggcgt gacctccacc 60 
eccaagacca ccaccaccgc ctccaagacc tccaccacca cctcctccac ctcctgcacc 120 
accccgaccg ccgtgtc 137 

<210> 55 

Oil> 300 

< 2 1 2 > PRT 

< 2 1 3 > Pyrococcus furiosus 



<400> 55 



- J. c 


T v r 




Va 1 


G l u 


L v s 


T vr 


His 


Th r 


Ser 


Glu 


ASD 


Lvs 


Ser 


Thr 


Ser 


1 

* 






5 










10 










15 




A ^ n 

."Oil 


Thr 

4 . U 


Cpr 


•J C i 

20 


Thr 

I'll 


P ro 


F ro 


Gin 


Th r 
25 


Thr 


Leu 


Ser 


Thr 


Thr 
30 


Lvs 


Val 


' .P i } 


t-> y o 


I 1 e 


Arc 


Tvr 


Pro 


Asp 


Asp 


Glv 


Glu 


Trp 


Pro 


Glv 


Ala 


Pro 


lie 




35 










40 










45 








«cn 


I ,\* S 

l .< j o 


A S D 


G 1 v 


As D 


Glv 


Asn 


Pro 


Glu 


Phe 


Tvr 


He 


Glu 


He 


Asn 


Leu 


50 








55 










60 










T ro 


noil 




uc u 


Asn 

n j 1 1 


Ala 


Thr 


Glv 


Phe 

Lit w 


Ala 


Glu 


Met 


Thr 


Tvr 


Asn 


Leu 


65 










70 








75 










80 


Th y 


^1=- r 
jt i 




Va 1 

v a x 


I ,e u 


His 


Tvr 


Val 


Gin 


Gin 


Leu 


Asd 


Asn 


He 


Val 


Leu 








85 










90 










95 




A r n 


fi o 


A r a 


Ser 


As n 


Tro 


Val 


His 


Glv 
v / 


Tyr 


Pro 


Glu 


lie 


Phe 


Tyr 


Gly 




100 










105 










110 






A^n 


I ^ 


Pro 


Trn 


Asn 


Ala 


Asn 


Tvr 


Ala 


Thr 


Asp 


Gly 


Pro 


He 


Pro 


Leu 






115 








120 










125 








Pro 


Ser 


Lys 


Val 


Ser 


Asn 


Leu 


Thr 


Asp 


Phe 


Tyr 


Leu 


Thr 


lie 


Ser 


Tyr 




130 








135 










140 










Lys 


Leu 


Glu 


Pro 


Lys 


Asn 


Gly 


Leu 


Pro 


He 


Asn 


Phe 


Ala 


He 


Glu 


Ser 


145 










150 










155 










160 


Trp 


Leu 


Thr 


Arg 


Glu 


Ala 


Trp 


Arg 


Thr 


Thr 


Gly 


He 


Asn 


Ser 


Asp 


Glu 






165 










170 










175 




Gin 


Glu 


Vai 


Met 


lie 


Trp 


He 


Tyr 


Tyr 


Asp 


Gly 


Leu 


Gin 


Pro 


Ala 


Gly 








130 








185 










190 






Ser 


Lys 


Va 1 


Lys 


Glu 


lie 


Val 


Val 


Pro 


He 


He 


Val 


Asn 


Gly 


Thr 


Pro 




195 








200 










205 








Val 


Asn 
210 


Ala 


Thr 


Phe 


Glu 


Val 
215 


Trp 


Lys 


Ala 


Asn 


He 
220 


Gly 


Trp 


Glu 


Tyr 


Vai 


Ala 


Phe 


Arg 


lie 


Lvs 


Thr 


Pro 


lie 


Lvs 


Glu 


Gly 


Thr 


Val 


Thr 


He 


225 








230 










235 










240 


Pro 


Tyr 


Gly 


Ala 


Phe 


lie 


Ser 


Val 


Ala 


Ala 


Asn 


He 


Ser 


Ser 


Leu 


Pro 






245 










250 










255 




Asn 


Tyr 


Thr 


Glu 


Leu 


Tyr 


Leu 


Glu 


Asp 


Val 


Glu 


He 


Gly 


Thr 


Glu 


Phe 






260 








265 










270 






Gl y 


Thr 


Pro 


Ser 


Thr 


Thr 


Ser 


Ala 


His 


Leu 


Glu 


Trp 


Trp 


He 


Thr 


Asn 




275 










280 










285 








i 1 e 


Thr 
290 


Leu 


Thr 


Pro 


Leu 


Asp 
295 


Arg 


Pro 


Leu 


He 


Ser 
300 











<210> 56 

<211> 903 

<212> DMA 

< 2 1 3 > Pyrococcus furiosus 
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<400> 36 

atctacttcg tggagaagta ccacaccccc gaggacaagt ccacctccaa cacctcctcc 60 

accccgccgc agaccaccct ctccaccacc aaggcgctca agatccgcta cccggacgac 120 

ggcgagtggc ccggcgcccc gatcgacaag gacggcgacg gcaacccgga gttctacatc 180 

gagatcaacc tctggaacat cctcaacgcc accggcttcg ccgagatgac ctacaacctc 240 

actagtggcg tgctccacta cgtgcagcag ctcgacaaca tcgtgctccg cgaccgctcc 300 

aaccgggtgc acggctaccc ggaaatcttc tacggcaaca agccgtggaa cgccaactac 360 

gccaccgacg gcccgatccc gcccccgtcc aaggtgtcca acctcaccga cttctacctc 420 

accatctcct acaagctcga gccgaagaac ggtcccccga tcaacttcgc catcgagtcc 480- 

tggctcaccc gcgaggcctg gcgcaccacc ggcaccaact ccgacgagca ggaggtgatg 540 

atctggatct actacgacgg cctccagccc gcgggctcca aggtgaagga gatcgtggtg 600 

ccgatcatcg tgaacggcac cccggtgaac gccaccttcg aggtgtggaa ggccaacatc 660 

ggctgggagt acgtggccct ccgcaccaag accccqatca aggagggcac cgtgaccatc 720 

ccgtacggcg ccttcatctc cgtggccgcc aacatctcct ccctcccgaa ctacaccgag 780 

aagcacctcg aggacgtgga gatcggcacc gagtccggca ccccgtccac cacctccgcc 840 

cacctcoagt ogtggatcac caacatcacc ctcaccccgc tcgaccgccc gctcatctcc 900 

tag ' 903 

<210> 57 
< 2 1 1 > 387 
<212> PRT 

<213> Thermus flavus 
<400> 57 

Met Tyr Glu Pro Lys Pro Glu His Arg Phe Thr Phe Gly Leu Trp Thr 

IS 10 15 

Val Asp Asn Val Asp Arg Asp Pro Phe Gly Asp Thr Val Arg Glu Arg 

20 25 30 

Leu Asp Pro Val Tyr Val Val His Lys Leu Ala Glu Leu Gly Ala Tyr 

35 40 45 

Gly Val Asn Leu His Aso Glu Asp Leu lie Pro Arg Gly Thr Pro Pro 

50 55 60 

Gin Glu Arg Asd Gin He Val Arg Arg Phe Lys Lys Ala Leu Asp Glu 
65 70 75 80 

Thr Val Leu Lys Val Pro Met Val Thr Ala Asn Leu Phe Ser Glu Pro 

85 90 95 

Ala Phe Arg Aso Gly Ala Ser Thr Thr Arg Asp Pro Trp Val Trp Ala 

100 105 110 

Tyr Ala Leu Arg Lys Ser Leu Glu Thr Met Asp Leu Gly Ala Glu Leu 

115 120 125 

Gly Ala Glu He Tyr Met Phe Trp Met Val Arg Glu Arg Ser Glu Val 

130 135 140 

Glu Ser Thr Asp Lys Thr Arg Lys Val Trp Asp Trp Val Arg Glu Thr 
145 150 155 160 

Leu Asn Phe Met Thr Ala Tyr Thr Glu Asp Gin Gly Tyr Gly Tyr Arg 

165 170 175 

Phe Ser Val Glu Pro Lys Pro Asn Glu Pro Arg Gly Asp He Tyr Phe 

130 185 190 

Thr Thr Val Gly Ser Met Leu Ala Leu lie His Thr Leu Asp Arg Pro 

195 200 205 

Glu Arg Phe Glv Leu Asn Pro Glu Phe Ala His Glu Thr Met Ala Gly 

210 215 220 

Leu Asn Phe Asp His Ala Val Ala Gin Ala Val Asp Ala Gly Lys Leu 
225 230 235 240 

Phe His He Asp Leu Asn Asp Gin Arg Met Ser Arg Phe Asp Gin Asp 

245 250 255 

Leu Arg Phe Gly Ser Glu Asn Leu Lys Ala Gly Phe Phe Leu Val Asp 

260 265 270 
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Leu Leu Glu Ser Ser Gly Tyr Gin Gly Pro Arg His Phe Glu Ala His 

275 280 285 

Ala Leu Arg Thr Glu Asp Glu Glu Gly Val Trp Thr Phe Val Arg Val 

290 295 300 

Cvs Men Arg Thr Tyr Leu Tie lie Lys Val Arg Ala Glu Thr Phe Arg 
305 310 315 320 

Glu Asp Pro Glu Val Lys Glu Leu Leu Ala Ala Tyr Tyr Gin Glu Asp 

325 330 335 

Pro Ala Thr Leu Ala Leu Leu Asp Pro Tyr Ser Arg Glu Lys Ala Glu 

340 345 350 

Ala Leu Lvs Arg Ala Glu Leu Pro Leu Glu Thr Lys Arg Arg Arg Gly 

355 360 365 

Tyr Ala Leu Glu Arg Leu Asp Gin Leu Ala Val Glu Tyr Leu Leu Gly 

210 375 380 

Vai Arg Glv 
335 



<210> 58 
<211> 97 8 
<2I2> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> synthetic 
<400> 58 

atggggaaga acggcaacct gtgctgcttc tctctgctgc tgcttcttct cgccgggttg 60 

acgcccggcc atcaaatcta cttcgtggag aagtaccaca cctccgagga caagtccacc 120 

tccaacacct cctccacccc gccgcagacc accctctcca ccaccaaggt gctcaagatc 180 

cgccacccg^ acgacggtga gtggcccggc gccccgatcg acaaggacgg cgacggcaac 240 

cccgagtcct acaccgagat caacctctgg aacatcctca acgccaccgg cttcgccgag 300 

atgacctaca acctcactag tggcgtgctc cactacgtgc agcagctcga caacatcgtg 360 

ctccgcgacc gctccaactg ggtgcacggc tacccggaaa tcttctacgg caacaagccg 420 

tggaacgcca actacgccac cgacggcccg atcccgctcc cgtccaaggt gtccaacctc 460 

accgacttct acctcaccat ctcctacaag ctcgagccga agaacggtct cccgatcaac 540 

ttcgccatcg agtcctggct cacccgcgag gcctggcgca ccaccggcat caactccgac 600 

gagcaggagg tgatgatctg gatctactac gacggcctcc agcccgcggg ctccaaggtg 660 

aaggagatcg tggtgccgat catcgtgaac ggcaccccgg tgaacgccac cttcgaggtg 720 

tggaaggcca acatcggctg ggagtacgtg gccttccgca tcaagacccc gatcaaggag 780 

ggcaccgtga ccatcccgta cggcgccttc atctccgtgg ccgccaacat ctcctccctc 

ccgaactaca ccgagaagca cctcgaggac gtggagatcg gcaccgagtt cggcaccccg 

cccaccacct ccgcccacct cgagtggtgg atcaccaaca tcaccctcac cccgct.cgac 
cgcccgctca tctcctag 

<2\0> 59 
<211> 1920 
<212> DNA 

< 2 1 3 > Aspergillus niger 
<400> 59 

atgtccttcc gctccctcct cgccctctcc ggcctcgtgt gcaccggcct cgccaacgtg 60 

atctccaagc gcgccaccct cgactcctgg ctctccaacg aggccaccgt ggcccgcacc 120 

gccatcctca acaacatcgg cgccgacggc gcctgggtgt ccggcgccga ctccggcatc 180 

gtggtggcct ccccgtccac cgacaacccg gactacttct acacctggac ccgcgactcc 240 

ggcctcgtgc tcaagaccct cgtggacctc ttccgcaacg gcgacacctc cctcctctcc 300 

accatcgaga actacatctc cgcccaggcc atcgtgcagg gcatctccaa cccgtccggc 360 

gacctctcct ccggcgccgg cctcggcgag ccgaagttca acgtggacga gaccgcctac 420 



840 
900 
960 
978 



Page 59 



WO 2005/096804 PCT/US2004/007182 



accggctcct ggggccgccc gcagcgcgac ggcccggccc tccgcgccac "cgcca tgatc 480 

ggccccggcc agtggcccct cgacaacggc tacacctcca ccgccaccga catcgtgtgg 540 

ccgctcgtgc gcaacgacct ctcctacgtg gcccagtact ggaaccagac cggctacgac 600 

ccctgggagg aggtgaacgg ctcctccttc ttcaccatcg ccgtgcagca ccgcgccctc 660 

gtggagggcc ccgccttcgc caccgccgtg ggctcctcct gctcctggtg cgactcccag 720 

gccccggaga tcctctgcta cctccagtcc ttctggaccg gctccttcat cctcgccaac 780 

ttcgactcct cccgctccgg caaggacgcc aacaccctcc tcggctccat ccacaccttc 840 

gacccggagg ccgcctgcga cgactccacc ttccagccgt gctccccgcg cgccctcgcc 900 

aaccacaagg aggtggtgga ctccttccgc tccatctaca ccctcaacga cggcctctcc 960 

gactccgagg ccgtggccgt gggccgctac ccggaggaca cctactacaa cggcaacccg 1020 

tggtccctct gcaccctcgc cgccgccgag cagctctacg acgccctcta ccagtgggac 1080 

aaqcagggct ccctcgaggt gaccgacgtg tccctcgact tcttcaaggc cctctactcc 1140 

gacgccgeca ccggcaccta ctcctcctcc tcctccacct actcctccat cgtggacgcc 1200 

gtoaagacct tcgccgacgg cttcgtgtcc atcgtggaga cccacgccgc ctccaacggc 1260 

tccatgtccg agcagtacga caagtccgac ggcgagcagc tctccgcccg cgacctcacc 1320 

tqgtcctacg ccgccctcct caccgccaac aaccgccgca actccgtggt gccggcctcc 1380 

cggggcgaga cctccgcctc ccccgtgccg ggcacctgcg ccgccacctc cgccatcggc 1440 

acctactcct ccgtgaccgt gacctcctgg ccgtccatcg tggccaccgg cggcaccacc 1500 

accaccgcca ccccgaccgg ctccggctcc gtgacctcca cctccaagac caccgccacc 1560 

gcctccaaga cctccacctc cacctcctcc acctcctgca ccaccccgac cgccgtggcc 1620 

gtgacctlcg acctcaccgc caccaccacc tacggcgaga acatctacct cgtgggctcc 1680 

atctcccagc tcggcgactg ggagacctcc gacggcatcg ccctctccgc cgacaagtac 1740 

acctcccccg acccgccccg gtacgtgacc gtgaccctcc cggccggcga gtccttcgag 1800 

tacaagttca tccgcatcga gtccgacgac tccgtggagt gggagtccga cccgaaccgc 1860 

gagtaeaccg tgccgcaggc otgcggcacc tccaccgcca ccgtgaccga cacctggcgc 1920 



<2)0> 60 

<21 )> 6 

<212> PRT 

< 2 1 3 > Artificial Sequence 



<220> 

<223> synthetic 
<400> 60 

Ser Glu Lys Asp Glu Leu 
1 5 



< 2 1 0 > 61 
<211> 561 

< 2 1 2 > ON A 

< 2 1 3 > Artificial Sequence 
<2?.0> 

<223> Xyianase BD7436 



<220> 

<221> CDS 

<222> (1) . . (561) 



<400> 61 

atg get age acc ttc tac tgg cat ttg tgg acc gac ggc ate ggc acc 48 
Met Ala Ser Thr Phe Tyr Trp His Leu Trp Thr Asp Gly lie Gly Thr 
15 10 15 
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gtg aac get acc aac ggc age gac ggc aac cac age gtg age tgg age 96 

Vai Asn A.!a Thr Asn Civ Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

aac tgc ggc aac tec gtg gtg ggc aag ggc tgg acc acc ggc age get 144 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

acc agg gtg ate aac tac aac get cat get ttc age gtg gtg ggc aac 192 

Thr Arg Val He Asn Tyr Asn Ala His Ala Phe Ser Val Val Gly Asn 
50 55 60 

get tac ctg get ttg tac ggc tgg acc agg aac age ttg ate gag tac 240 

Ala Tvr Leu Ala Leu Tyr Giy Trp Thr Arg Asn Ser Leu He Glu Tyr 

65 ~>0 75 80 

tac gtg gtg gac age tgg ggc acc tac agg cca acc ggc acc tac aag 288 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 ' 90 95 

ggc acc gtg acc age gac ggc ggc acc tac gac ate tac acc acc acc 336 

Glv Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 

agg acc aac get cca age ate gac ggc aac aac acc acc ttc acc caa 384 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

ttc tgg age gtg agg caa age aag agg cca ate ggc acc aac aac acc 432 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

ate acc ttc age aac cat gtg aac get tgg aag age aag ggc atg aac 480 

He Thr ?he Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 

145 1 SO 155 160 

ctg ggc age age egg age tac caa gtg ttg get acc gag ggc tac caa 528 

Leu Gly Ser Ser Tro Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 HO 175 

age age ggc tac age aac gtg acc gtg tgg tag 561 

Ser Ser Glv Tvr Ser Asn Val Thr Val Trp 

185 





180 


< 2 1 0 > 


62 


<2H> 


186 


<212> 


PR? 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


62 



Met Ala Ser Thr Phe Tyr Trp His Leu Trp Thr Asp Gly lie Gly Thr 
15 10 15 
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Val Asn Ala Thr Asn Glv Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 



Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 4 0 4 5 



Thr Arg Val lie Asn Tyr Asn Ala His Ala Phe Ser Val Val Gly Asn 
50 55 60 



Ala Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 70 ' 75 80 



Tvr Val Val Asp Ser Trp Gly Thr Tvr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 



Glv Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 



Arg Thr Asn Ala Pro Ser lie Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 



Phe Trp Ser Val Arg Gin Ser Lys Arg Pro lie Gly Thr Asn Asn Thr 
130 135 140 



lie Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 



Leu Glv Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 17 5 



Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

160 185 



<210> 63 

< 2 1 1 > 561 

< 2 1 2 > DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Xyianase BD6002A 



<220> 

<221> CDS 

<222> i\\ . . (561) 
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<400> 63 

atg get sec acc gac tac tgg caa aac tgg acc gac ggc ggc ggc acc 48 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 

i 5 10 15 

gtg aac get acc aac ggc age gac ggc aac tac age gtg age tgg age 96 

Vai Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

aac tec ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc age get 144 

Asn Cvs Giv Asn Fhe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35* 40 45 

acc agg gtg ate aac tac aac get ggc get ttc age cca age ggc aac 192 

Thr Arg Vai lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
SO 55 60 

ggc tac ttg get ttg tac ggc tgg acc agg aac age ttg ate gag tac 240 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 70 75 80 

tac gtg gtg gac age tgg ggc acc tac agg cca acc ggc acc tac aag 288 

Tvr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

ggc acc gtg acc age gac ggc ggc acc tac gac ate tac acc acc acc 336 

Glv Thr Val Thr Ser Asp Giv Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 

aag acc aac get cca age ate gac ggc aac aac acc acc ttc acc caa 384 

Arg Thr Asn Ala Pro Ser lie Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

ttc tgg age gtg agg caa age aag agg cca ate ggc acc aac aac acc 432 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro lie Gly Thr Asn Asn Thr 
130 135 140 

ate acc etc age aac cat gtg aac get tgg aag age aag ggc atg aac 480 

lie Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

ttg ggc age age tgg age tac caa gtg ttg get acc gag ggc tac caa 528 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

age age ggc tac age aac gtg acc gtg tgg tag 561 

Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

160 185 



<210> 64 

<211> 166 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
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<400> 64 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
IS 10 15 



Val Asn Ala Thr Asn Gly Ser Asd Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 



Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
25 40 45 



Thr Arg Val lie Asn Tvr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 

50 5 5 60 



Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 ' 70 ' 75 80 



Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 



Gly Thr Val Thr Ser Asp Glv Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 ' 110 



Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
US 120 125 



Phe Trp Ser Val Arg Gin Ser Lys Arg Pro lie Gly Thr Asn Asn Thr 
130 135 140 



Tie Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 



Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 



Ser Se 


r Gly Tyr Ser Asn Va 




130 


<2)0> 


65 


<21 1> 


561 


<2)2> 


ON A 


<213> 


Artificial Sequence 


<220> 




<223> 


Xylanase BD6002B 



135 
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<220> 

<22l> CDS 

<2?2> ( 1 ) . . (561 ) 

<400> 65 

acq gec tec acc gac tac tgg cag aac tgg acc gac ggc ggc ggc acc 48 

Mec Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 

) 5 10 IS 

gcg aac gec acc aac ggc tec gac ggc aac tac tec gtg tec tgg tec 96 

Vai Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc tec gec 144 

Asn Cys Glv Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

acc cgc gtg ate aac tac aac gee ggc gee ttc tec ccg tec ggc aac 192 

Thr Arq Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 ' 55 60 

ggc tac etc gee etc tac gee tgg acc cgc aac tec etc ate gag tac 240 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 

65 70 ' 75 80 

tac gtg gtg gac tec tgg ggc acc tac cgc ccg acc ggc acc tac aag 288 

Tvr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

65 90 95 

ggc acc gtg acc tec gac ggc ggc acc tac gac ate tac acc acc acc 336 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 

cgc acc aac gee ccg tec ate gac ggc aac aac acc acc ttc acc cag 384 

Arg Thr Asn Ala Pro Ser lie Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

ttc tgg tec gtg cgc cag tec aag cgc ccg ate ggc acc aac aac acc 432 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

ate acc ttc tec aac cac gtg aac gec tgg aag tec aag ggc atg aac 480 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 

145 150 155 160 

etc ggc tec tec tgg tec tac cag gtg etc gec acc gag ggc tac cag 528 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

tec tee ggc tac tec aac gtg acc gtg tgg tga 561 

Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



<210> 66 
<2\l> 186 
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<212> PRT 

<2.13> Arcificial Sequence 

<220> 

<22 3> Synt.net; ic Construct 
<400> 66 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 10 15 



Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 



Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

Thr Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50~ 55 60 

Glv Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 10 75 80 

Tyr val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 



Arg Thr Asn Ala Pro Ser lie Asp Gly Asn Asn Thr Thr Phe Thr Gin 

115 120 125 

°he Tro Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 

130 135 140 

n e Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 

:*s 150 155 160 

Leu Giy Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 * 170 175 



Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



<210> 67 
<2H> 207 1 
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< 2 1 2 > DNA 

< 2 1 3 > Orvza sativa 

<220> 

< 2 2 1 > .Tusc_f eat ure 

< 2 2 2 > (1 ) . . (2071) 

< 2 2 3 > Promoter 

<400> 67 



tcca tgctgt 


cct actact t 


get tcatccc 


ct tctacat t 


t tgt tctggt 


1 1 1 tggcctg 


60 


cat c t cqga t 


catgatgtat 


gtgat ttcca 


atetgetgea 


ata tgaa tgg 


agactctgtg 


120 


c t a r » c e a t c a 


acaaca t gaa 


a tget tat ga 


ggcett tget 


gagcagecaa 


tettqectgt 


180 


dictate tct 


tcacaqgccg 


aa t tcctctg 


tt ttgttttt 


caccctcaat 


a tttggaaac 


240 


all tatctaq 


gttgtttgtg 


tccaggccta 


taaa teat ac 


a tgatgttgt 


cgtat tggat 


300 


atqaa cqtgq 


tgqcgtqt t c 


agtgcct t gg 


atttgagttt 


gatgagagt t 


get tctgggt 


360 


caccac tcac 


cat ta tcaa t 


gctcctct tc 


agca taaggt 


aaaagtct tc 


cctgtttacg 


420 


ttat 1 1 tacc 


cacta tggt t 


get tgggt tg 


gttt tttcct 


gattget tat 


gccatggaaa 


480 


qtcatttqat 


a tqt tqaact 


tgaa t taact 


qtagaattgt 


at acatgt tc 


catttgtgt t 


540 


q tact t cct t 


ct C t cct at t 


agtagectea 


gatgagtgtg 


aaaaaaacaq 


attata taac 


600 


t tgeccta t a 


aatcat t tqa 


aaaaaata tt 


gtacagtgag 


aaattga tat 


a tagtgaat t 


660 


tttaaqaqca 


tqt 1 1 tccta 


aagaag t a ta 


tattttctat 


gtacaaaggc 


ca t tgaagta 


720 


attgeagata 


caqga taa tg 


tagacttt tt 


ggacttacac 


tgetaect 1 1 


aagtaacaat 


780 


ca tgagcaa t 


agtgt tgcaa 


tgatatt tag 


getgeat teg 


tttactctct 


tgattt ccat 


840 


gagcacget t 


cccaaactgt 


taaactctgt 


gtttt ttgee 


aaaaaaaaat 


gca taggaaa 


fl A A 


qttacttcta 


aaaaa tcata 


tcaa tccatt 


ttttaagtta 


tagctaa tac 


t taat taatc 


960 


acgcgctaac 


aagtcactct 


gtt t ttcgta 


ctagagaga t 


tgtt t tgaac 


cagcactcaa 


1020 


gaacacagcc 


t taacccagc 


caaa taatgc 


tacaacctac 


cagtccacac 


ctcttgtaaa 


1080 


gcatt tgt tg 


catggaaaag 


ctaaga tgac 


agcaacctgt 


tcaggaaaac 


aactgacaag 


1140 


ctcac aggga 


qagggagct t 


t tggaaaggt 


geegt gcagt 


tcaaacaatt 


agt tagcagt 


1200 


agggecttgg 


t ttttgetea 


cagcaataag 


aagt taatca 


tggtgtaggc 


aacccaaata 


1260 


aaacaccaaa 


acatqeacaa 


ggeagt t tgt 


tgtat tctgt 


agt acaqaca 


aaactaaaag 


1320 


taa tgaaaqa 


aqa tgtggtg 


t tagaaaagg 


aaacaa tatc 


atgagtaatg 


tgtgggcat t 


1380 


atqqqaccac 


gaaa taaaaa 


gaacatt ttg 


atgagtcgtg 


tatcctcgat 


gagectcaaa 


1440 
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agttctctca 


ccccgga taa 


gaaaccctta 


agcaatgtgc 


aaagtttgca 


ttct ccactg 


1500 


aca tad tgca 


aaa taaga ta 


tea tcga tga 


catagcaact 


catgeatcat 


atcatgcctc 


1560 


tctcaacct a 


ttca t t ccta 


ctcacctaca 


taagta tct t 


cagctaaatg 


ttagaacata 


1620 


aaccca taag 


teaegt t tga 


tgagtattag 


gcgtgacaca 


tgacaaatca 


cagactcaag 


1680 


caaga t aaag 


caaaatgatg 


tgtacat aaa 


actccagagc 


tatatgtcat 


a t tgcaaaaa 


•1740 


qaggagagct 


t a taagacaa 


ggca tgactc 


acaaaaattc 


atttgeett t 


cgtgtcaaaa 


1800 


agaggagggc 


tttacat tat 


ccatgtcata 


t tgeaaaaga 


aagagagaaa 


gaacaacaca 


1860 


a tgctgcgtc 


aat tacaeat 


atctgta tgt 


ccatcat tat 


tea tccacct 


t tcgtgtacc 


1920 


acact tea ta 


tatcatgagt 


cact tea tgt 


ctggacatta 


acaaactcta 


tcttaaca 1 1 


1980 


t aga tgcaag 


agect t tatc 


tcactat aaa 


tgcacgatga 


tttctcattg 


t ttctcacaa 


2040 


aaagcat tea 


gttcat tagt 


cctacaacaa 


c 






2071 



<210> 68 

<211> 79 

<2 1 2 > PRT 

<?!3> Zea mays 



<220> 

<221> SIGNAL 

<2?2> (1) . . (79) 

< 2 2 3 > Maize waxy signal sequence . 

<400> 68 

Met Leu Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly 
1 5 10 15 



Leu Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly 

20 25 30 



Leu Arg Giy Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg 
35 40 45 



Thr Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg 
50 " 55 60 



Gly Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Ala 
65' 70 75 



<210> 69 
<2!1> 1003 
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< 2 1 2 > DNA 

< 2 1 3 > Artificial Sequence 
<220> 

< 22 3 > Synthetic Bromelain Sequence 



<220> 

< 2 2 i > CCS 

<222> (1) . . (1005) 

<2?3> Synchetic 3romelain 

<<!00> 69 

aco gee egg aag gtg cag gtg gtg tec etc ttc etc ttc etc tgc gtg 48 

Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
IS 10 15 

acq tgg gec tec ccg tec gee gee tec gcg gac gag ccg tec gac ccg 96 
Met Trp Ala Ser Pro Ser Ala Ala Ser Ala Asp Glu Pro Ser Asp Pro 

20 25 30 

atg atg aag cgc ttc gag gag tgg atg gtg gag tac ggc cgc gtg tac 144 
Met Met Lys Arg Phe Glu Glu Trp Met Val Glu Tyr Gly Arg Val Tyr 
35 40 45 

aag gac aac g3c gag aag atg cgc cgc ttc cag ate ttc aag aac aac 192 
Lvs Asp Asn Asp Glu Lys Met Arg Arg Phe Gin lie Phe Lys Asn Asn 
50 * 55 60 

gtg aac cac ate gag acc ttc aac tec cgc aac gag aac tec tac acc 240 
Val Asn His lie Glu Thr Phe Asn Ser Arg Asn Glu Asn Ser Tyr Thr 
55 70 75 80 

etc ggc ate aac cag ttc acc gac atg acc aac aac gag ttc ate gee 288 
Leu Gly lie Asn Gin Phe Thr Asp Met Thr Asn Asn Glu Phe lie Ala 

85 90 95 

cag tac acc ggc ggc ate tec cgc ccg etc aac ate gag cgc gag ccg 336 
Gin Tyr Thr Gly Glv lie Ser Arg Pro Leu Asn He Glu Arg Glu Pro 

100 * 105 110 

gtg gtg tec ttc gac gac gtg gac ate tec gee gtg ccg cag tec ate 384 
Val Val Ser Phe Asp Asp Val Asp lie Ser Ala Val Pro Gin Ser He 
115 120 125 

gac tgg cgc gac tac ggc gec gtg acc tec gtg aag aac cag aac ccg 432 
Asp Trp Arg Aso Tyr Gly Ala Val Thr Ser Val Lys Asn Gin Asn Pro 
130 * * 135 140 

tgc ggc gec tgc tgg gee ttc gee gee ate gec acc gtg gag tec ate 480 
Cys Gly Ala Cys Trp Ala Phe Ala Ala lie Ala Thr Val Glu Ser He 
145 150 155 160 

tac aag ate aag aag ggc ate etc gag ccg etc tec gag cag cag gtg 528 
Tyr Lys He Lys Lys Gly lie Leu Glu Pro Leu Ser Glu Gin Gin Val 

165 170 175 

etc gac tgc gee aag ggc tac ggc tgc aag ggc ggc tgg gag ttc cgc 576 
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Leu Asp Cvs Ala Lys Gly Tyr Gly Cys Lys Gly Gly Trp Glu Phe Arg 

h ' 130 185 190 

gcc tec gag tec ate ate tec aac aag ggc gtg gec tec ggc gec ate 

Ala Phe Glu Phe lie He Ser Asn Lys Gly Val Ala Ser Gly Ala lie 

3 95 200 205 

tec ccg uac aag gcc gcc aag ggc acc tgc aag acc gac ggc gtg ccg 

Tyr »rc Tyr Lys Ala Ala Lys Gly Thr Cys Lys Thr Asp Gly Val Pro 

210 215 220 

aac tec gcc tac ate acc ggc tac gcc cgc gtg ccg cgc aac aac gag 

Asn Ser Ala Tyr lie Thr Gly Tyr Ala Arg Val Pro Arg Asn Asn Glu 

225 ' 230 235 240 

tec tec atg atg tac gcc gtg tec aag cag ccg ate acc gtg gcc gtg 

5e- Ser Met Met Tyr Ala Val Ser Lys Gin Pro lie Thr Val Ala Val 

245 250 255 

gac gcc aac acc aac etc cag cac tac aag tec ggc gtg ttc aac ggc 

Asc Ala Asn Ala Asn Phe Gin Tyr Tyr Lys Ser Gly Val Phe Asn Gly 

260 265 270 

ccg vac gac acc tec etc aac cac gcc gtg acc gcc ate ggc tac ggc 

Pro Cys Gly Thr Ser Leu Asn His Ala Val Thr Ala lie Gly Tyr Gly 



280 285 



cag gac tec ate ate tac ccg aag aag tgg ggc gcc aag tgg ggc gag 

Gin Asp Ser lie lie Tyr Pro Lys Lys Trp Gly Ala Lys Trp Gly Glu 

290 295 300 

acc ggc tac ate cgc atg gcc cgc gac gtg tec tec tec tec ggc ate 

Ala GW Tyr lie Arg Met Ala Arg Asp Val Ser Ser Ser Ser Gly He 

205 310 315 320 

tgc ggc ate gcc ate gac ccg etc tac ccg acc etc gag gag tag 

rvc qu> He Ala lie Asp Pro Leu Tyr Pro Thr Leu Glu Glu 

325 ' 330 



<210> 


70 


<211> 


334 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


10 



Met Ala Tro Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
V * " 5 10 15 



Met Trp Ala Ser Pro Ser Ala Ala Ser Ala Asp Glu Pro Ser Asp Pro 

20 25 30 



624 



672 



720 



768 



816 



864 



912 



960 



1005 
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Met Met Lys Arg Phe Glu Glu Trp Met Val Glu Tyr Gly Arg Val Tyr 
35 40 45 



Lys Asp Asn Asp Glu Lys Met Arg Arg Phe Gin lie Phe Lys Asn Asn 
50 * 55 60 



Val Asn His lie Glu Thr Phe Asn Ser Arg Asn Glu Asn Ser Tyr Thr 
65 70 75 80 



Leu Gly lie Asn Gin Phe Thr Asp Met Thr Asn Asn Glu Phe lie Ala 

85 90 95 



Gin Tyr Thr Gly Gly He Ser Arg Pro Leu Asn He Glu Arg Glu Pro 

100 105 110 



Val val Ser Phe Asp Asp Val Asp lie Ser Ala Val Pro Gin Ser He 
115 120 125 



Asp Trp Arg Aso Tyr Gly Ala Val Thr Ser Val Lys Asn Gin Asn Pro 

130 ' 135 140 

Cys Gly Ala Cys Trp Ala Phe Ala Ala lie Ala Thr Val Glu Ser He 

H5 150 155 160 

Tyr Lys lie Lys Lys Gly He Leu Glu Pro Leu Ser Glu Gin Gin Val 

165 170 175 



Leu Aso Cys Ala Lys Gly Tyr Gly Cys Lys Gly Gly Trp Glu Phe Arg 

180 * 185 190 



Ala Phe Glu Phe He He Ser Asn Lys Gly Val Ala Ser Gly Ala He 
195 200 205 



r vr Pro Tyr Lys Ala Ala Lvs Gly Thr Cys Lys Thr Asp Gly Val Pro 

' 2)0 215 220 

Asn Ser Ala Tyr He Thr Gly Tyr Ala Arg Val Pro Arg Asn Asn Glu 

225 * 230 235 240 

Ser Ser Met Met Tyr Ala Val Ser Lys Gin Pro He Thr Val Ala Val 

245 250 255 



Asp Ala Asn Ala Asn Phe Gin Tyr Tyr Lys Ser Gly Val Phe Asn Gly 

260 265 270 



Page 71 



WO 2005/096804 



PCTAJS2004/007182 



Pro Cvs Gly Thr Ser Leu Asn His Ala Val Thr Ala lie Gly Tyr Gly 
* 215 280 285 

Gin Asp Ser lie lie Tvr Pro Lys Lys Trp Gly Ala Lys Trp Gly Glu 
290 295 300 

Ala Glv Tvr He Arq Met Ala Arg Asp Val Ser Ser Ser Ser Gly lie 
305 310 315 320 

Cvs Glv lie Ala He Asp Pro Leu Tyr Pro Thr Leu Glu Glu 

325 330 



<2I0> 71 
<2ii> 7S 
< 2 1 2 > DMA 

<213> Artificial Sequence 
<220> 

<223> Bromealin signal sequence 
<400> 71 

atggcctgga aggtgcaggt ggtgttcctc ttcctcttcc tctgcgtgat gtgggcctcc 
ccgcccgccg cccccgcc 



<210> 72 

<2H> 26 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Bromealin signal peptide 

<400> 72 

Met Ala Trp Lys Val Gin Vai Val Phe Leu Phe Leu Phe Leu Cys Val 



5 



10 is 



Met Trp Ala Ser Pro Ser Ala Ala Ser Ala 

20 25 



<210> 


73 


< 2 1 1 > 


1050 


<212> 


DMA 


<213> 


Art i f 


<220> 




< 2 2 3 > 


oSYNi 

■ 



60 
78 
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<400> 73 
atggcc tgga 


aggt gcagg l 


ggug<- ttLtt. 


ccgt ccgccg 


ccc ccgcgga 


eg a gv_eg tec 


a t ggt gQagc 


a egg ccg eg c 


g t dCaoy ydc 


c t ^aagaaca 


a eg i ga a cc a 


Ca CCyayaCC 


t cggca l ca 




tyova l y a c c 


gg ca t c t c~c 


gCCCgC t Cd a 


t" r* f~\ ^ t~\ r" r> r~" 

Ca Lcyatjtyt 


at cz ccgccg 


cgccgcag lc 


ca c ega e egg 


aaccagaacc 


cgcgcggcgc 


ctgetgygee 


tacaagat ca 


aga agggca t 




a agggc t acg 


geegcaaggg 




a agggcg t gg 


.-^ >^ f- /-^ /-^ /*i n 

lCl ccggcgi- 


Ca LCCacccy 


gacggcgt gc 


eg a a c iccgc 


CLaCaLCaCC 


tcctcca tga 


eg t aegecg c 


gcccaagcag 


aact tccagt 


actacaagtc 


cggcgtgttc 


gccgt gaccg 


ccatcggct a 


cggccaggac 


aagtggggcg 


aggceggcta 


ca teegcatg 


igcggcatcg 


cca tcgaccc 


gccctacccg 


occoccaact 


ccaccctcgt 


ggecgagtag 



<210> 

<21i> 1067 
< 2 1 2 > DNA 

<213> Artificial Sequence 
<220> 

<223> pSYN11589 
<400> 74 

tggcctggaa ggtgcaggtg gtgtccccct 
cgtccgccgc ctccgccccc tcctcctccc 
ccgaccgcgc cgcctccacc gaegagcegt 
ggacggcgga gcacggccgc gtgtacaagg 
tcttcaagaa caaegtgaac cacatcgaga 
ccctcggcac caaccagctc accgacatga 



ttcctcttcc tetgegtgat gtgggcctcc 60 

gacccgatga tgaagegect cgaggagtgg 120 

aacgacgaga agatgegecg cttccagatc 180 

ttcaactccc gcaacgagaa ctcctacacc 240 

aacaacgagc tcatcgccca gcacaccggc ' 300 

gagccggtgg tgtccttcga cgacgtggac 360 

cgcgactacg gcgccgtgac ctccgtgaag 420 

ttcgccgcca tcgccaccgt ggagtccatc 480 

ctctccgagc ageaggtget cgactgcgcc 540 

ttccgcgcct tcgagttcat catctccaac 600 

tacaaggecg ccaagggcac ctgcaagacc 660 

ggctacgccc gcgtgccgcg caacaacgag 720 

ccgatcaccg tggccgtgga cgccaacgcc 780 

aacggcccgt gcggcacctc cctcaaccac 840 

tccatcatct acccgaagaa gtggggcgcc 900 

gcccgcgacg tgtcctcctc ctccggcatc 960 

accctcgagg aggtgttcgc cgaggccatc 1020 

1050 



tcctcttcct ctgcgtgatg tgggcctccc 60 

ccgccgactc caacccgatc cgcccggtga 120 

ccgacccgat gatgaagege ttcgaggagt 180 

acaacgacga gaagatgege cgcttccaga 240 

cccccaactc ccgcaacgag aacccctaca 300 

ccaacaacga gttcatcgcc cagtacaccg 360 
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gcggca tctc 


ccgcccgctc 


aaca tcgagc 


gcgagccggt 


ggtgtccttc 


gacgacgtgg 


420 


aca tcz ccgc 


cgcgccgcag 


tcca ccgact 


ggegegacta 


cggcgccgtg 


acctccgtga 


480 


aqaaccaqaa 


cccg tgegge 


gcctgctggg 


cct tcgccgc 


ca tcgccacc 


gtggagtcca 


540 


tctacaaga t 


caagaagggc 


a tcctcgagc 


cgctctccga 


gcagcaggtg 


ctcgactgcg 


600 


ccaagggcta 

j .j j 


egge tgcaag 


ggcggctggg 


agt tccgcgc 


ct tcgagttc 


atcatctcca 


660 


acaaqagcgt 


qgect ccgac 


geca tctacc 


egtacaagge 


cgccaagggc 


acctgeaaga 


720 


ccgacggcgt 


accgaactcc 


gect aca t ca 


cegget aege 


ccgcgcgccg 


cgcaacaacg 


780 


agcccc ccat 


gatgcacgcc 


gcgtccaagc 


ageega tcac 


cgtggccgtg 


gacgccaacg 


840 


ccaac~ tcca 


gcaccacaag 


cccggcgtgt 


tcaacggccc 


gt gcggcacc 


tccct caacc 


900 


acgccg cgac 


cgeca tegge 


taeggecagg 


actccatcat 


ctacccgaag 


aagcggggcg 


960 


ccaag tgggg 


cgaggccggc 


tacatccgca 


tggcccgcga 


cgtgtcctcc 


tcctccggca 


1020 


tctgcggca t 


cgccatcgac 


ccgctctacc 


cgaccctcga 


ggagtag 




1067 


< 2 1 0 > 75 

< 2 1 1 > 1023 
<2i2> DNA 

<213> Artificial Sequence 










<220> 

<223> pSYN11587 Sequence 










<4 00> 7 5 
atggcctgga 


aggtgcaggt 


ggtgttcctc 


ttcctcttcc 


tetgegtgat 


gtgggcctcc 


60 


ccgt ccgccg 


cctccgcgga 


cgagccgtcc 


gacccgatga 


tgaagegett 


cgaggagtgg 


120 


a tgct ggagt 


acggccgcgt 


gtacaaggac 


aacgacgaga 


agatgegecg 


cttccagatc 


180 


c ccaagaaca 


acq tgaacca 


ca tcgagacc 


t tcaactccc 


gcaacgagaa 


ctcctacacc 


240 


ctcggcatca 


accagi Lcac 


cgacatgacc 


aacaacgagt 


tcatcgccca 


gtacaccggc 


300 


ggcac ccccc 


occcacccaa 


cat egagege 


gagccggtgg 


tgtcct tcga 


cgacgtggac 


360 


a tec ccgccg 


z geegcag tc 


ca tcgactgg 


mm t aca 


acaccat aac 


ct ccq taaaq 


420 


aac-zaqaacc 


cgcgcggcgc 


ctgctgggcc 


cccgccgcca 


tcgccaccgt 


ggagtcca tc 


480 


zacaaqa c ca 


agaagggcac 


cct egagecg 


ctctccgagc 


ageaggtget 


cgactgcgcc 


540 


aagggccacg 


getgeaaggg 


eggctgggag 


t tccgcgcct 


tcgagttcat 


catctccaac 


600 


aagggcgtgg 


cccccggcgc 


catctacccg 


tacaaggecg 


ccaagggcac 


ctgcaagacc 


660 


gacggcgcgc 


cgaactccgc 


ctacatcacc 


ggctacgccc 


gcgtgccgcg 


caacaacgag 


720 



Page 74 



WO 2(105/096804 PCTYUS2004/007182 



780 
840 
900 
960 



tcctccacga tgtacgccgt qtccaagcag ccgatcaccg tggccgtgga cgccaacgcc 

aactzccagt accacaagcc cggcgtgttc aacggcccgt gcggcacctc cctcaaccac 

gccgtgaccg ccatcggcta cggccaggac tccatcatct acccgaagaa gtggggcgcc 

aagtgqggcq aggccggcta cacccgcatg gcccgcgacg tgtcctcctc ctccggcacc 

tgcggcatcg ccatcgaccc gctctacccg accctcgagg agtccgagaa ggacgagctg 1020 
tag 1023 

<210> 76 

< 2 1 1 > 990 

< 2 1 2 > DNA 

<21 3> Artificial Sequence 
<220> 

<222> pSYN12169 Sequence 



«cgaqggtgt 


tgctcgt tgc 


cctcgct etc 


ctggctctcg 


ctgcgagcgc 


cacctccatg 


60 


qcggacqagc 


eg t ccgaccc 


gatgatgaag 


cget tcgagg 


agtggatggt 


ggagtaegge 


120 


cgcq t ataca 


acgacaacga 


cgagaagatg 


cgccgct tec 


agatctt caa 


gaacaacgtg 


180 


aaccaca teg 


agacct tcaa 


c t cccgcaac 


gagaactcct 


acaccctcgg 


catcaaccag 


240 


t tcaccgaca 


tgaccaacaa 


cgagt t catc 


gcccagtaca 


ccggcggca t 


ctcccgcccg 


300 


ctcaacatcg 


agcgcgagcc 


ggtggtgtcc 


ttcgacgacg 


tggacatctc 


cgccgtgccg 


360 


cactccatcg 


actggegega 


ctacggcgcc 


gtgacctccg 


tgaagaacca 


gaacccgtgc 


420 


qgcgcct get 


gggect lege 


cgeca tegee 


accgtggagt 


ccatctacaa 


ga tcaagaag 


480 


ggca tccccg 


agecget etc 


cgagcageag 


gtgetcgact 


gcgccaaggg 


ctacggctgc 


540 


aagggcgqct 


gggagt tccg 


cgect tcgag 


ttcatcatct 


ccaacaaggg 


cgtggcctcc 


600 


ggegecat ct 


acccgtacaa 


ggccgccaag 


ggcacctgca 


agaccgaegg 


cgtgccgaac 


660 


tcegcctaca 


tcaccggcta 


cgcccgcgtg 


ccgcgcaaca 


acgagtcctc 


catgatgtac 


720 


gccgtgtcca 


ageagcega t 


caccgtggcc 


gtggacgcca 


acgccaact t 


ccagtactac 


780 


aagtccggcg 


tgt teaaegg 


cccgtgcggc 


acctccctca 


accacgccgt 


gaccgccatc 


840 


ggccacggcc 


aggactccat 


catct acccg 


aagaagtggg 


gcgccaagtg 


gggegaggee 


900 


gectacat cc 


gca tggcccg 


cgacgtgtcc 


tcctcctccg 


geatctgegg 


catcgccatc 


960 


gacccgccct 


acccgacccc 


cgaggagtag 








990 
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<210> 77 

<2 1 1 > 1170 

<212> DNA 

<213> Artificial Sequence 
<220> 



<223> pSYN12575 Sequence 










<400> 11 
atqctggcgg 


ctctggccac 


gtcgcagctc 


g tcgcadcgc 


gcgccggcc l 


y y y u y l <- <— »^y 


60 


qacqcgtcca 


cgt t ccqccg 


cggcgccgcg 


cagggee t ga 


ggggggcccg 


yy uy \- <-.y y uy 


120 

J- €m W 


gcoqcgaaca 


cgctcagca t 


gcggaccagc 


gcgcgcgcgg 






180 


caqacqcccc 


gcggggccaq 


gt tcccgtcg 






raacocca to 


240 


qcaqacgagc 


cgtccgaccc 


ga tgatgaag 


/*» n t* t* ^ n rt n 

cgc l l cgagy 


ant* nna t* nnf 
ay i- yy « l y y u 


\J w w V Uwx^^^ 


300 


eqcg t gtaca 


aqgacaacga 


cgagaaga tg 




a w a \» <w v. t w » w 


aaacaacQ to 


360 


aaccaca ccg 


agacct tcaa 


ctcccgcaac 




flrflrrrt 1 ran 


cat caaccaa 


420 


t tcaccgaca 


tgaccaacaa 


cgagt tea tc 




f raaraQcat 

VJ U W U W ^- U V. 


c t cccqccca 


480 


ct caacatcg 


agcgcgagcc 


ggt ggtgtcc 




f nnaratctc 


caccq tqeco 


540 


cagtcca teg 


actggegega 


ccacggcgcc 






aaaccca t ac 


600 


ggcgcctgct 


gggect t cgc 


cgccatcgcc 


accg tggag l 


l_ t_ Cl LL t o L, O u 




660 






caaqcaqcaq 


nf nrtroact 

y L y ^— itsjOvv 


acaccaa aac 


ctacqqctqc 


720 


aagggegget 


gggagttccg 


cgccttcgag 


ttcatcatct 


ccaacaaggg 


cgtggcctcc 


*? a f\ 


ggegeca tct 


acccgtacaa 


ggccgccaag 


ggcacctgca 


agaccgaegg 


cgtgccgaac 


840 


tccacctaca 


tcaccggcta 


cgcccgcgtq 


ccgcgcaaca 


acgagtcctc 


catgatgtac 


900 


gccgtgtcca 


ageagcega t 


caccg tggee 


gtggacgcca 


acgccaact t 


ccagtact ac 


960 


aagtecggcg 


tgt tcaa egg 


cccqt gegge 


acctccctca 


accacgccgt 


gaccgcca tc 


1020 


gqctaccocc 


aggact cca t 


catctacccg 


aagaagtggg 


gcgccaagtg 


gggegaggee 


1080 


ggctacatcc 


gca tggcccg 


cgacgtgt cc 


tcctcctccg 


gca tetgegg 


catcgccatc 


1140 


gacccgctct 


acccgaccct 


cgaggagt ag 








1170 



<2i0> 78 

<211> 1068 

c2)2> DNA 

<213> Artificial Sequence 
<220> 

<223> pSM270 Sequence 
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<400> 78 
a tggcctgga 


agg tgcaggt 


gg tgc tcctc 


t tcctcttcc 


tctgcgtgat 


gtgggcctcc 


60 


ccgtccgccg 


cct ccgcc t c 


ctcctcctcc 


t tcgccgact 


ccaacccgat 


ccgcccggtg 


120 


accgaccgcc 


ccgcctccac 


cgacgagccg 


tccgacccga 


tga tgaagcg 


ct tcgaggag 


180 


tggatggtgg 


ag t a cggccg 


cgcgtacaag 


gacaacgacg 


agaagatgcg 


ccgct tccag 


240 


a tct tcaaga 


acaacgtgaa 


ccaca tcgag 


accttcaact 


cccgcaacga 


gaactcctac 


300 


accctcggca 


tcaaccagt t 


caccgacatg 


accaacaacg 


agt tcatcgc 


ccagtacacc 


360 


ggcggcatct 


cccgcccgct 


caacat cgag 


cgcgagccgg 


tggtgtcct t 


cgacgacgtg 


420 


gacatctccg 


ccg tgccgca 


gtccatcgac 


tggcgcgact 


acggcgccgt 


gacct ccgtg 


480 


aagaaccaga 


acccgtgcgg 


cgcctgctgg 


gccttcgccg 


ccat cgccac 


cgtggagtcc 


540 


a tctacaaga 


ccaagaaggg 


catcctcgag 


ccgctctccg 


agcagcaggt 


gctcgactgc 


600 


gccaagggct 


acggctgcaa 


gggcggccgg 


gagttccgcg 


ccttcgagt t 


catca tctcc 


660 


aacaagggcg 


tggcccccgg 


cgccat ccac 


ccgtacaagg 


ccgccaaggg 


cacctgcaag 


720 


accgacggcg 


tgccgaact c 


cgcctacatc 


accggctacg 


cccgcgtgcc 


gcgcaacaac 


780 


gagtcc tcca 


tga t gtacgc 


cgtgtccaag 


cagccgatca 


ccgtggccgt 


ggacgccaac 


840 


gccaac 1 1 cc 


agt actacaa 


gtccggcgtg 


t tcaacggcc 


cgtgcggcac 


ctccctcaac 


900 


cacqccgtga 


ccgcca tcgg 


ct acggccag 


gactccatca 


tctacccgaa 


gaagtggggc 


960 


gccaagtggg 


gcgsggccgg 


ct acatccgc 


acggcccgcg 


acgtgtcctc 


ctcctccggc 


1020 


a tctgccgca 


tcgcca tcga 


cccgccctac 


ccgaccctcg 


aggagtag 




1068 



<210> 79 

< 2 1 1 > 1497 

<212> DNA 

< 2 1 3> Trichoderma reesei 



<220> 

<221> CDS 

<222> 11).. (1497) 

<223> Trichoderma reesei ce 1 lobiohyrodlase I 

<400> 79 

atg cag teg gcg tot act etc caa teg gag act cac ccg cct ctg aca 48 
Met Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Lea Thr 
1 S 10 15 

tga cag aaa tgc teg tct ggt ggc acg tgc act caa cag aca ggc tec 96 
Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

20 ' 25 30 
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gtg gtc ate gac gec aac egg cgc tgg act cac get acg aac age age 
Val Val He Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 
35 40 45 

acg aac tgc tac gat ggc aac act tgg age teg ace eta tgt cct gac 
Thr Asn Cvs Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
50 * 55 60 

aac cag acc tgc gcg aag aac tgc tgt ctg gac ggt gcc gcc tac gcg 
Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
65 ^O 75 80 

tec acq tac gaa gtt acc acg age ggt aac age etc tec att ggc ttt 
ser Thv Tyr Glv Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe 

' 35 90 95 

etc acc caq cct gcg cag aag aac gtt ggc get cgc ctt tac ctt atg 
Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 

100 105 110 

gcg age gac acg acc tac cag gaa ttc acc ctg ctt ggc aac gag ttc 
Ala s-e>- A*p Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 
115 120 125 

tct ttc gat gtt gat gtt teg cag ctg ccg tgc ggc ttg aac gga get 
Ser D he Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
130 135 140 

etc cac ttc gcg tec atg gac gcg gat ggt ggc gtg age aag tat ccc 
Leu Tv Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 
145 1^0 155 160 

acc aac acc qct age gcc aag tac ggc acg ggg tac tgt gac age cag 
Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

165 170 175 

tgt ccc cgc gat ctg aag tec ate aat ggc cag gcc aac gtt gag ggc 
Cvs Fro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly 

160 185 190 

egg cag ccg tea ccc aac aac gcg aac acg ggc att gga gga cac gga 
Tro G=u Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 
195 200 205 

age tec tgc tct gag atg gat ate tgg gag gcc aac tec ate tec gag 
S^r cvs Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu 
" 2i0 215 220 

■~cc c t acc ccc cac cct tgc acg act gtc ggc cag gag ate tgc gag 
Ai a L*u Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu 
225 ~ 230 235 240 

agt oat ggg tgc ggc oga act tac tec gat aac aga tat ggc ggc act 
Gly Asp Glv Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 

245 250 255 

rgr- gat ccc gat ggc tgc gac tgg aac cca tac cgc ctg ggc aac acc 
Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 
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age etc tac 

Ser Phe Tyr 
275 

ttg acc gtt 

Leu Thr Val 
290 

cat gtc cag 

Tvr Val Gin 
305 

agt tac tct 

Ser Tyr Ser 



gca gaa ttc 
Ala Glu Phe 



ttc aag aag 
Phe Lys Lys 
355 

qat gat tac 
Asp Asp Tyr 
37Q 

aac gag acc 
Asn Glu Thr 
385 

age tec ggt 
Ser Ser Gly 



gtc acc ttc 
Val Thr Phe 



cct age ggc 
Pro Ser Gly 
'3 35 

acc cgc cgc 
Thr Arg Arg 

cct cac tac 

Ser His Tyr 
465 

cgc gee age 

Cys Ala Ser 



260 

ggc cct ggc 
Gly Pro Gly 



gtc acc cag 
Val Thr Gin 



aat ggc gtc 

Asn Gly Val 
310 

ggc aac gag 

Gly Asn Glu 
325 

ggc gga tec 

Gly Gly Ser 
340 

get acc tct 

Ala Thr Ser 



tac gee aac 
Tyr Ala Asn 



tec tec aca 

Ser Ser Thr 
390 

gtc cct get 

Val Pro Ala 
405 

tec aac ate 

Ser Asn lie 
420 

ggc aac cct 

Gly Asn Pro 



cca gec act 
Pro A La Thr 



ggc cag tgc 
Gly Gin Cys 
470 

ggc aca act 
Gly Thr Thr 
465 



265 

tct age ttt 
Ser Ser Phe 
230 

ttc gag acg 
Phe Glu Thr 
295 

act ttc cag 
Thr Phe Gin 



etc aac gat 
Leu Asn Asp 



tct ttc tea 
Ser Phe Ser 
345 

ggc ggc atg 
Gly Gly Met 
360 

atg ctg tgg 
Met Leu Trp 
375 

ccc ggt gee 
Pro Gly Ala 



cag gtc gaa 
Gin Val Glu 



aag ttc gga 
Lys Phe Gly 
425 

ccc ggc gga 
Pro Gly Gly 
440 

acc act gga 
Thr Thr Gly 

455 

ggc ggt att 
Gly Gly He 



tgc cag gtc 
Cys Gin Val 



acc etc gat 
Thr Leu Asp 



teg ggt gee 
Ser Gly Ala 
300 

cag ccc aac 
Gin Pro Asn 
315 

gat tac tgc 
Asp Tyr Cys 
330 

gac aag ggc 
Asp Lys Gly 



gtt ctg' gtc 
Val Leu Val 



ctg gac tec 
Leu Asp Ser 
380 

gtg cgc gga 
Val Arg Gly 
395 

tct cag tct 
Ser Gin Ser 
410 

ccc att ggc 
Pro He Gly 



aac ccg cct 
Asn Pro Pro 



age tct ccc 
Ser Ser Pro 
460 

ggc tac age 
Gly Tyr Ser 
475 

ctg aac cct 
Leu Asn Pro 
490 



270 

acc acc aag 

Thr Thr Lys 
285 

ate aac cga 

lie Asn Arg 



gec gag ctt 
Ala Glu Leu 



aca get gag 
Thr Ala Glu 

'335 

ggc ctg act 
Gly Leu Thr 
350 

atg agt ctg 
Met Ser Leu 
365 

acc tac ccg 
Thr Tyr Pro 



age tgc tec 
Ser Cys Ser 



ccc aac gec 

Pro Asn Ala 
415 

age acc ggc 

Ser Thr Gly 
430 

ggc acc acc 

Gly Thr Thr 
445 

gga cct acc 

Gly Pro Thr 



ggc ccc acg 
Gly Pro Thr 



tac tac tct 
Tyr Tyr Ser 
495 



aaa 864 
Lys 



tac 912 
Tyr 



ggt 960 

Gly 

320 

gag 1008 
Glu 



cag 1056 
Gin 



tgg 1104 
Trp 



aca 1152 
Thr 



acc 1200 

Thr 

400 

aag 1248 
Lys 



aac 1296 
Asn 



acc 1344 
Thr 



cag 1392 
Gin 



gtc 1440 

Val 

480 

cag 1488 
Gin 
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cgc ctq caa 1497 

Cys Leu 



<210> 80 

<2!1> 498 

<2)2> FRT 

<2!3> Trichoderma reesei 

<400> 80 

Met Gin Ser Ala Cvs Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
1 10 15 



Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

20 25 30 



Vai Vai He Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 
35 40 45 



Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
50 * 55 60 



Asn Glu Thr Cys Ala Lys Asn Cys Cvs Leu Asp Gly Ala Ala Tyr Ala 
65 ' 70 75 80 



5e' Thr Tyr Gly Vai Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe 

9 5 90 95 



Vai Thr Gin Ser Ala Gin Lys Asn Vai Gly Ala Arg Leu Tyr Leu Met 

100 105 110 



h\a Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 
115 120 125 



S»r Phe Asp Vai Aso Vai Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
i 30 135 14 0 



Leu Tvr Phe Vai Ser Met Asp Ala Asp Gly Gly Vai Ser Lys Tyr Pro 
145 ' 150 155 160 



Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

165 HO 175 



Cys Pro Ara Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Vai Glu Gly 

180 185 190 
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Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 
195 200 205 

Ser Cys Cys S»r Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu 
210 215 220 

Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu 
22 c 230 235 240 

Gly *sp G*y Cvs Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 
' * 245 250 255 

O'S Aso Pro Asp Gly Cys Asd Trp Asn Pro Tyr Arg Leu Gly Asn Thr 

260 * 265 270 



Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 

275 230 285 

Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr 

290 295 300 

Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 

305 310 315 320 

Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 

325 330 335 

Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 

340 J 345 350 

Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 

355 360 365 

Asp Asp Tvr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 

370 375 380 

A*n G>u Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 

3 is " 390 395 400 

Ser <; e c Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 

405 4 10 4 15 
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val Thr Phe Ser Asn lie Lys Phe Gly Pro lie Gly Ser Thr Gly Asn 

420 425 430 

Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
435 440 ^45 

Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 
450 455 460 

Ser His Tyr Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val 
465 470 475 480 

Cvs A^a Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 
* " 485 490 495 



Cys Leu 



<r210> 31 

<2U> 1365 

< 2 ! 2 > DNA 

<2i3> Trichoderma reesei 



<220> 

<221> CDS 



<222> (1) . • (1365) 

<223> trichoderma reesei ce 1 lobiohydrolase II 
<400> SI 

atg gcg ccc cca gag gag egg caa get cgc tea age gtc tgg ggc caa 

Met Val Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser Val Trp Gly Gin 

i 5 10 ' 15 

tot ggc ggc caa aac tgg teg ggt ccg act tgc tgt get tec gga age 

Cvs Gly Gly Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly Ser 

20 25 30 

ar-a fgc gtc tac tec aac gac tat tac tec cag tgt ctt ccc ggc get 

~hr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Gly Ala 

35 40 45 

gca age tea aac teg tec acg cgc gec gcg teg acg act tea cga gta 

aia Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg Val 

50 *3 60 

c~<- ccc aca aca tec egg teg age tec gcg acg cct cca cct ggt tct 

Se- Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly Ser 

65' 70 75 80 

acr act ace aga gta cct cca gtc gga teg gga acc get acg tat tea 

Thr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr Ser 



48 



96 



144 



192 



240 



288 
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85 90 95 

ggc aac cct tec gtt ggg gtc act cct tgg gec aat gca tat tac gec 

Gly Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

100 105 110 

tct gaa gtt age age etc get att cct age ttg act gga gec atg gec 

Ser Giu Val Ser Ser Leu Ala Tie Pro Ser Leu Thr Gly Ala Met Ala 

115 120 125 

act cct gca gca get gtc gca aag gtt ccc tct ttt atg tgg eta gat 

Tnr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu Asp 

130 135 1^0 

act ctt gac aag acc cct etc atg gag caa acc ttg gec gac ate cgc 

Thr L^u Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp lie Arg 

]45 150 155 ' 160 

acc acc aac aag aat ggc ggt aac tat gec gga cag ttt gtg gtg tat 

'h*- Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val Tyr 

165 170 1"?5 

gac ttg ccg gat cgc qat tgc get gec etc gee teg aat ggc gaa tac 

^sp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr 

ISO 185 190 

tct att gec gat ggt ggc gtc gee aaa tat aag aac tat ate gac acc 

Ser lie Ala Aso Gly Gly Val Ala Lys Tyr Lys Asn Tyr lie Asp Thr 

195 * 200 205 

att cgt caa att gtc gtg gaa tat tec gat ate egg acc etc ctg gtt 

lie Arg Gin lie Val Val Glu Tyr Ser Asp lie Arg Thr Leu Leu Val 

210 215 220 

ate gag cct aac tct ctt gee aac ctg gtg acc aac etc ggt act cca 

lie Glu Pro Aso Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Thr Pro 

225 ' 230 235 240 

aag tgt gec aat get cag tea gee tac ctt gag tgc ate aac tac gee 

Lys Cys Ala Asn Ala Gin Ser Ala Tyr Leu Glu Cys lie Asn Tyr Ala 

245 250 255 

ate aca cag ctg aac ctt cca aat gtt gcg atg tat ttg gac get ggc 

Val Thr Gin Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly 

260 265 270 

cat gca gga tgg ctt ggc tgg ccg gca aac caa gac ccg gee get cag 

His Ala Gly Trc Leu Gly Tro Pro Ala Asn Gin Asp Pro Ala Ala Gin 

275 ' 230 285 

eta ttt gca aat gtt tac aag aat gca teg tct ccg aga get ctt cgc 

L--u Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala Leu Arg 

290 295 300 

qga ttg gca acc aat gtc gee aac tac aac ggg tgg aac att acc age 

Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn He Thr Ser 

305 310 315 320 
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ccc cca teg cac acg caa ggc aac get gtc tac aac gag aag ctg tac 
Pro P^o Ser Tyr Thr Gin Gly Asn Ala Val Tyr Asn Glu Lys Leu Tyr 

325 330 335 

see cac get att gga cct ctt ctt gcc aac cac ggc tgg tec aac gcc 
Tie Hi* Ala lie Gly Pro Leu Leu Ala Asn His Gly Trp Ser Asn Ala 

340 345 350 

ttc tec ate act gat caa ggt cga teg gga aag cag cct ace gga cag 
Php Phe lie Thr Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr Gly Gin 
355 360 365 

caa caa tga gga gac tgg tgc aat gtg ate ggc acc gga ttt ggt att 
Gin Gin Trp Gly Asp Trp Cys Asn Val lie Gly Thr Gly Phe Gly lie 
370 375 380 

cor cca tec gca aac act ggg gac teg ttg ctg gat teg ttt gtc tgg 
Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe Val Trp 
335 390 395 400 

arc aag cca ggc gac gag tgt gac ggc acc age gac age agt gcg cca 
Oa) Lys Pro Glv Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser Ala Pro 
' * 405 410 416 

cq^ ret gac tec cac tgt gcg etc cca gat gcc ttg caa ccg gcg cct 
Arq Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro Ala Pro 

420 * 425 430 

caa get ggt get tgg ttc caa gcc tac ttt gtg cag ctt etc aca aac 
Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr Asn 
435 1 440 445 

gca aac cca teg ttc ctg tag 
Ala Asn Pro Ser Phe Leu 
450 



<210> 52 

<2U> 454 

<212> PRT 

<213> Trichoderma reesei 

<400> 82 

Met Va^ Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser Val Trp Gly Gin 
1 5 10 15 

Cv« Gly Glv Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly Ser 
'20 2b 30 

Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Gly Ala 
35 40 45 

A^a Ser ^er Se" Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg Val 
50 *5 60 
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c er p f0 T hr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly Ser 

70 7 ^ 80 

~hr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr Ser 

85 90 95 

Glv Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

100 105 HO 

c er ciu Val Ser Ser Leu Ala He Pro Ser Leu Thr Gly Ala Met Ala 
U5 120 125 

Thr A 1 a Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu Asp 
130 135 1"0 

T h r Leu Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp lie Arg 
MS 150 155 160 

Thr A'a Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val Tyr 

165 I 70 175 

Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr 

180 I 85 iyu 

*ar Tie Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr He Asp Thr 
* ,95 200 205 

He Arq Gin lie Val Val Glu Tyr Ser Asp lie Arg Thr Leu Leu Val 
210 215 220 

fie Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Thr Pro 



?=, 230 



L L J 



235 240 



Lys Cys Ala Asn Ala Gin Ser Ala Tyr Leu Glu Cys lie Asn Tyr Ala 

245 250 ^ D 

val Thr Gin Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly 

260 265 270 

His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gin Asp Pro Ala Ala Gin 

275 280 285 
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Leu Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala' Leu Arg 
290 295 300 



Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn lie Thr Ser 
305 310 315 320 



Pro Pro Ser Tyr Thr Gin Gly Asn Ala Val Tyr Asn Glu Lys Leu Tyr 

325 330 335 



lie His Ala Tie Gly Pro Leu Leu Ala Asn His Gly Trp Ser Asn Ala 

340 345 350 



Phe Phe lie Thr Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr Gly Gin 
355 360 365 



Gin Gin Trp Gly Aso Trp Cys Asn Val He Gly Thr Gly Phe Gly He 
370 375 380 



Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe Val Trp 
385 390 395 400 



Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser Ala Pro 

4 05 00 415 



A 



rg Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro Ala Pro 

4 20 4 25 4 30 



Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr Asn 
435 440 445 



Ala Asn Pro Ser Phe Leu 
4 50 



<210> 63 

<?M> 1317 

< 2 1 2 > ON A 

< 2 1 3 > Trichoderma reesei 



<220> 

<221> CDS 

<222> (1) . . (1317) 

< 2 2 3 > Trichoderma reesei endoglucanase I 
<400> 83 

atg cag caa ccg gga acc age acc ccc gag gtc cat ccc aag ttg aca 48 
Met Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr 
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! 5 10 15 

acc cac aag tgc aca aag ccc ggg ggg tgc gtg gec cag gac acc teg 
Thr Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser 

20 25 30 

etc gtc etc gac tag aac tac cgc tgg atg cac gac gca aac tac aac 
Vai Val Leu Aso Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn 
35 " 40 45 

teg tgc acc acc aac ggc ggc gtc aac acc acg etc tgc cct gac gag 
Ser Cvs Thr Val Asn Gly Gly Val Asn Thr Thr Led Cys Pro Asp Glu 
SO 55 60 

gco acc cgc ggc aag aac tgc ttc ate gag ggc gtc gac tac gee gec 
ha Thr Cvs Gly lys Asn Cys Phe lie Glu Gly Val Asp Tyr Ala Ala 
65 ' ^0 75 80 

i-cq ggc gec acg acc teg ggc age age etc acc atg aac cag tac atg 
Ser Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met 

85 9° 95 

ccc age age tct ggc ggc tac age age gtc tct cct egg ctg tat etc 
°ro S»r Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu 

100 105 no 

ctg gac tec gac ggt gag cac gcg acg ccg aag ccc aac ggc cag gag 
l.eu Asp Ser Aso Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu 
1 1 5 ' 120 125 

-ta' aac ttc gac gtc gac etc tct get ctg ccg tgt gga gag aac ggc 
leu Ser Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly 
130 135 140 

ccg etc tac ctg tec cag acg gac gag aac ggg ggc gec aac cag tat 
"r Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr 

150 155 1^0 

aac acg gec got gec aac tac ggg age ggc tac tgc gat get cag tgc 
Asn Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys 

165 1 7 0 175 

crc gtc cag aca teg agg aac ggc acc etc aac act age cac cag ggc 
P^o Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly 

160 165 190 

ttc tgc toe aac gag atg gat ate ctg gag ggc aac teg agg gcg aat 
Phe C«s Cys Asn Glu Met Aso lie Leu Glu Gly Asn Ser Arg Ala Asn 
' 195 200 205 

gee ttg acc cct cac tct tgc acg gee acg gec tgc gac tct gec ggt 
Ala Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly 
210 2)5 220 



cn C acc tec aac ccc tat ggc age ggc tac aaa age tac tac ggc ccc 
Cvs Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser Tyr Tyr Gly Pro 
225 230 235 240 
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gaa gat acc gtt gac acc tec aag acc ttc acc ate ate acc cag ttc 
Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr lie lie Thr Gin Phe 

245 250 255 

aac acg aac aac ggc teg ccc teg ggc aac ctt gtg age ate acc cgc 
asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val Ser lie Thr Arg 

260 265 270 

aao rac cag caa aac ggc gtc gac ate ccc age gee cag ccc ggc ggc 
Lvs Ty f Gin Gin Asn Glv Val Asp lie Pro Ser Ala Gin Pro Gly Gly 
275 ' 280 285 

gac acc acc icq ccc cgc ccg tec gee tea gee tac ggc ggc etc gee 
Asp Thr lie Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly Leu Ala 
290 295 300 

no- acg age aag gec ccg age age ggc atg gtg etc gtg ttc age att 
Thr K-t GW Lvs Ala Leu Ser Ser Gly Met Val Leu Val Phe Ser lie 
305 ~ ~* ' 310 315 320 

tag aac gac aac age cag tac atg aac egg etc gac age ggc aac gee 
Trp A Sn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn Ala 

325 330 335 

o-c ccc tgc age age acc gag ggc aac cca tec aac acc ctg gec aac 
Glv p-o Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn Thr Leu Ala Asn 

340 345 350 

aac ccc aac acg cac gec gtc etc tec aac ate cgc tgg gga gac att 
A Sn Pro Asn Thr His Val Val Phe Ser Asn lie Arg Trp Gly Asp lie 
355 360 365 

qqq tct act acg aac teg act gcg ccc ccg ccc ccg cct gcg tec age 
Glv Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pro Ala Ser Ser 
' 370 375 380 

a<-q acq tct ccg act aca egg agg age teg acg act teg age age ccg 
Trv- Thr Phe Ser Thr Thr Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro 
385 390 395 400 

aqc tgc acg cag act cac tgg ggg cag tgc ggt ggc att ggg tac age 
Ser Cys Thr Gin Thr His Trp Gly Gin Cys Gly Gly lie Gly Tyr Ser 

405 410 415 



768 



qaq cgc aag aca cgc acg teg ggc act acq tgc cag tat age aac gac 

Glv Cvs Lys Thr Cvs Thr Ser Giy Thr Thr Cys Gin Tyr Ser Asn Asp 

420 425 430 

tac cac teg caa Cgc etc tag 

Tyr Tyr Ser Gin Cys Leu 
•135 



<210> 34 

<2il> 438 

< 2 1 2 > PRT 

< 2 1 3 > Trichoderma reesei 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1317 
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<400> 34 

Met Gin Gin Pro Glv Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr 
1 5 10 15 



Thr Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser 

20 25 30 



Val Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn 
35 40 45 

S*r Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu 
50 55 60 

Va Thr Cys Gly Lys Asn Cys Phe lie Glu Gly Val Asp Tyr Ala Ala 
65 ^O 75 80 

Ser Gly Va 1 T h- Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met 

35 90 95 

Pro Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu 

100 105 HO 

Leu Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu 
115 120 125 

Leu *er Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly 
130 135 140 

Q er Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr 
145 150 155 160 

A-n Th- Ala Glv Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys 
" ' 165 HO 175 

Pro Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly 

180 185 190 

Ph° C»s Cys Asn Glu Met Asp lie Leu Glu Gly Asn Ser Arg Ala Asn 
1 195 200 205 

Ala Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly 
210 215 220 
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Cys Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser Tyr Tyr Gly Pro 
225 230 235 240 



Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr lie lie Thr Gin Phe 

245 2S0 255 



Asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val Ser He Thr Arg 

260 265 270 



Lys Tyr Gin Gin Asn Gly Val Asp lie Pro Ser Ala Gin Pro Gly Gly 
275 280 285 



Asp Thr lie Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly Leu Ala 
290 295 300 



Thr Met Gly Lys Ala Leu Ser Ser Gly Met Val Leu Val Phe Ser He 
305 310 315 320 



Trp Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn Ala 

325 330 335 



Gly Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn Thr Leu Ala Asn 

340 345 350 



Asn Pro Asn Thr His Val Val Phe Ser Asn He Arg Trp Gly Asp lie 
355 360 365 



Glv Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pro Ala Ser Ser 
370 375 380 



Thr Thr Phe Ser Thr Thr Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro 
385 390 395 400 



Ser Cys Thr Gin Thr His Trp Gly Gin Cys Gly Gly He Gly Tyr Ser 

405 410 415 



Gly Cys Lys Thr Cys Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp 

420 425 430 



Tyr Tyr Ser Gin Cys Leu 
4 35 



<210> 35 
<2H> 954 
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<212> DNA 

<2}3> Artificial Sequence 
<220> 

<223> 6GP1 



<220> 

< 221 > CDS 

<222> (1 ) . . (954 ) 

<223> 6GP1 



<400> 85 

atg ggc gtg gac ccg ttc gag cgc aac aag ate etc ggc cgc ggc ate 

Met Gly val Asp Pro Phe Glu Arg Asn Lys lie Leu Gly Arg Gly lie 
1 5 10 15 



gto ate aag gac gag ttc ttc gac ate ate aag gag gee ggc ttc tec 

Val lie Lys Asp Glu Phe Phe Asp lie lie Lys Glu Ala Gly Phe Ser 
35 40 45 

cac gtg cgc ate ccg ate cgc tgg tec acc cac gee tac gee ttc ccg 

His Val Arg lie Pro lie Arg Trp Ser Thr His Ala Tyr Ala Phe Pro 
50 55 60 

ccg tac aag ate atg gac cgc ttc ttc aag cgc gtg gac gag gtg ate 

Pro Tyr Lys lie Met Asp Arg Phe Phe Lys Arg Val Asp Glu Val lie 

65 ' 70 75 80 

aac ggc gee etc aag cgc ggc etc gee gtg gee ate aac ate cac cac 

Asn Gly Ala Leu Lys Arg Gly Leu Ala Val Ala lie Asn lie His His 

85 90 95 

tac gag gag etc atg aac gac ccg gag gag cac aag gag cgc ttc etc 

Tvr Glu Glu Leu Met Asn Asp Pro Glu Glu His Lys Glu Arg Phe Leu 

100 105 110 

gee etc tgg aag cag ate gee gac cgc tac aag gac tac ccg gag acc 

Ala Leu Trp Lys Gin He Ala Asp Arg Tyr Lys Asp Tyr Pro Glu Thr 
115 120 125 

etc ttc ttc gag ate etc aac gag ccg cac ggc aac etc acc ccg gag 

Leu Phe Phe Glu lie Leu Asn Glu Pro His Gly Asn Leu Thr Pro Glu 
130 135 140 

aag tgg aac gag ctg etc gag gag gee etc aag gtg ate cgc tec ate 

Lys Trp Asn Glu Leu Leu Glu Glu Ala Leu Lys Val He Arg Ser He 

145 150 155 160 

gac aag aag cac acc ate ate att ggc acc gca gag tgg gga ggc ate 

Asp Lys Lys His Thr lie He lie Gly Thr Ala Glu Trp Gly Gly He 

165 WO 175 

tec gec etc gag aag etc tec gtg ccg aag tgg gag aag aat tec ate 



48 



aac ate ggc aac gec ctg gag gee ccg aac gag ggc gac tgg ggc gtg 96 
Asn He Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 

20 25 30 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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Ser Ala Leu Glu Lys Leu Ser Val Pro Lys Trp Glu Lys Asn- Ser He 

180 185 190 

gtg acc ate cac tac tac aac ccg ttc gag ttc acg cac cag ggc gec 624 

Val. Thr He His Tyr Tyr Asn Pro Phe Glu Phe Thr His Gin Gly Ala 

195 200 205 

gag egg gtg gag ggc ccc gag aag egg etc ggc cgc aag tgg ggc tec 672 

Glu Trp Val Glu Gly Ser Glu Lys Trp Leu Gly Arg Lys Trp Gly Ser 

210 215 220 

ccg cac gac cag aag cac ccc ate gag gag ttc aac ttc ate gag gag 720 

Pro Asp Asd Gin Lys His Leu He Glu Glu Phe Asn Phe lie Glu Glu 

225 * 230 235 240 

tgg ccc aag aag aac aag cgc ccg ate tac ate ggc gag ttt ggc gee 768 

Trp Ser Lvs Lys Asn Lys Arg Pro lie Tyr lie Gly Glu Phe Gly Ala 

245 250 255 



tac cgc aag gee gac ccc gag tee cgc ate aag tgg acc tec ttc gtg 

Tyr Arg Lys Ala Asp Leu Glu Ser Arg lie Lys Trp Thr Ser Phe Val 

260 265 270 

gtg cat gag atg gag aag cgc cgc tgg tec tgg gee tac tgg gag ttc 

Val Arg Glu Met Glu Lys Arg Arg Trp Ser Trp Ala Tyr Trp Glu Phe 

275 280 285 



< 2 1 0 > 


36 


<2H> 


317 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


36 



Me*: Gly Val Asp Pro Phe Glu Arg Asn Lys lie Leu Gly Arg Gly lie 
1 5 10 15 



Asn lie Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 

20 25 30 



Val He Lys Asp Glu Phe Phe Asp He He Lys Glu Ala Gly Phe Ser 
35 40 45 



816 



864 



tgc tec ggc ttc ggc gtg tac gac acc etc cgc aag acc tgg aac aag 912 

Cys Ser Glv Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 

290 " 295 300 

gac etc ccc gag gee ccc ate ggc ggc gac tec ate gag tag 954 

Asp Leu Leu Glu Ala Leu lie Gly Gly Asp Ser lie Glu 

305 310 315 
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Hjs Val Arg lie 
50 



Pro Tyr Lys lie 

65 



Asn Gly Ala Leu 



Tyr Glu Glu Leu 

100 



Ala Leu Trp Lvs 
115 



Leu Phe Phe Glu 
130 



Lys Trp Asn Glu 
M5 



Asp Lys Lys His 



Ser Ala Leu Glu 

180 



Val Thr lie His 
195 



Glu Trp Val Glu 
210 



Pro Asp Asp Gin 
225 



Trp Ser Lys Lys 



Tyr Arc Lys Ala 

2 60 



Val Arg Glu Met 
275 



Pro lie Arg Trp 
55 



Met Asp Arg Phe 
70 



Lys Arg Gly Leu 
85 



Met Asn Asp Pro 



Gin He Ala Asp 

120 



lie Leu Asn Glu 
135 



Leu Leu Glu Glu 
150 



Thr lie lie lie 
165 



Lys Leu Ser Val 



Tyr Tyr Asn Pro 

200 



Gly Ser Glu Lys 
215 



Lvs His Leu lie 
230 



Asn Lys Arg Pro 
24 5 



Asp Leu Glu Ser 



Glu Lys Arg Arg 

280 



Ser Thr His Ala 

60 



Phe Lys Arg Val 
75 



Ala val Ala lie 
90 



Glu Glu His Lys 
105 



Arg Tyr Lys Asp 



Pro His Gly Asn 

140 



Ala Leu Lys Val 
155 



Gly Thr Ala Glu 
170 



Pro Lys Trp Glu 
185 



Phe Glu Phe Thr 



Trp Leu Gly Arg 

220 



Glu Glu Phe Asn 
235 



lie Tyr He Gly 
250 



Arg He Lys Trp 
265 



Trp Ser Trp Ala 



Tyr Ala Phe Pro 



Asp Glu Val He 

80 



Asn lie His His 
95 



Glu Arg Phe Leu 
110 



Tyr Pro Glu Thr 
125 



Leu Thr Pro Glu 



He Arg Ser He 

160 



Trp Gly Gly He 
175 



Lys Asn Ser lie 
190 



His Gin Gly Ala 
205 



Lys Trp Gly Ser 



Phe He Glu Glu 

240 



Glu Phe Gly Ala 
255 



Thr Ser Phe Val 
270 



Tyr Trp Glu Phe 
285 
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Cys Ser Gly Phe Gly Vai Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 
290 295 300 



Asp Leu Leu Glu Ala Leu lie Gly Giy Asp Ser lie Glu 
30b 310 315 



<210> 3 7 

<211> 1248 

<212> ONA 

< 2 1 3 > Mordeum vulgare 



<220> 

<221> CDS 

<222> (1) . . (1248) 

<223> Barley Amy I amylase 

<400> 37 

atg gca cac caa gtc etc ttt cag ggg ttc aac tgg gag teg tgg aag 
Met Ala His Gin Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys 
15 10 15 

cag age ggc ggg tgg tac aac atg atg atg ggc aag gtc gac gac ate 
Gin Ser Giy Gly Tro Tvr Asn Met Met Met Gly Lys Val Asp Asp lie 

20 25 30 

qcc get gec cga gtc acc cac gtc tgg ctg cca ccg ccg teg cac tec 
Ala Ala Ala Gly Vai Thr His Val Trp Leu Pro Pro Pro Ser His Ser 
35 40 45 

gtc tec aac gaa ggt tac atg cct ggt egg ctg tac gac ate gac gcg 
Val Ser Asn Glu Glv Tyr Met Pro Gly Arg Leu Tyr Asp He Asp Ala 
50 ' 55 60 

tec aag tac ggc aac gcg gcg gag etc aag teg etc ate ggc gcg etc 
Ser Lvs Tyr Gly Asn Ala Ala Glu Leu Lys Ser Leu lie Gly Ala Leu 
65 70 75 80 



ggc ace tec gac ggc cgc etc gac tgg ggc ccc cac atg ate tgt cgc 

Gly Thr Ser Asp Gly Arg Leu Asp Trp Gly Pro His Met He Cys Arg 

115 120 125 

gac gac acc aaa tac tec gat ggc acc gca aac etc gac acc gga gee 

Asp Asp Thr Lys Tyr Ser Asp Gly Thr Ala Asn Leu Asp Thr Gly Ala 

130 135 140 



48 



96 



144 



192 



240 



cac ggc aag ggc gtg cag gee ate gee gac ate gtc ate aac cac cgc 288 

His Gly Lys Gly Val Gin Ala lie Ala Asp lie Vai lie Asn His Arg 

85 90 95 

tgc gee gac tac aag gat age cgc ggc ate tac tgc ate ttc gag ggc 336 

Cys Ala Asp Tyr Lys Asp Ser Arg Gly lie Tyr Cys He Phe Glu Gly 

100 105 110 



384 



432 
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gac ttc ccc gcc gcg ccc gac ate gac cac etc aac gac egg gtc cag 480 
Asp Phe Ala Ala Ala Pro Asp He Asp His Leu Asn Asp Arg Val Gin 
MS 150 155 160 

egc gag etc aag gag tgg etc etc tgg etc aag age gac etc ggc ttc 528 
Arg Glu Leu Lys Glu Trp Leu Leu Trp Leu Lys Ser Asp Leu Gly Phe 

165 HO 175 

gac gcg tgg cgc ctt gac ttc gcc agg ggc tac teg ccg gag atg gcc 576 
Asp Ala Trp Arg Leu Asp Phe Ala Arg Gly Tyr Ser Pro Glu Met Ala 

160 185 190 

aag gtg tac ate gac ggc aca tec ccg age etc gcc gtg gcc gag gtg 624 
Lys Val Tyr He Asp Gly Thr Ser Pro Ser Leu Ala Val Ala Glu Val 
195 200 205 

tgg gac aat atg gcc acc ggc ggc gac ggc aag ccc aac tac gac cag 672 
Trp Asp Asn Met Ala Thr Gly Gly Asp Gly Lys Pro Asn Tyr Asp Gin 
210 215 220 

gac gcg cac egg cag aat ctg gtg aac tgg gtg gac aag gtg ggc ggc 720 
Aso Ala His Arg Gin Asn Leu Val Asn Trp Val Asp Lys Val Gly Gly 
225 230 235 240 

gcg gcc teg oca ggc atg gtg ttc gac ttc acg acc aaa ggg ata ctg 768 
Ala Ala Ser Ala Gly Met Val Phe Asp Phe Thr Thr Lys Gly lie Leu 



245 250 25 



aac get gcc gtg gag ggc gag ctg tgg agg ctg ate gac ccg cag ggg 816 
Asn Ala Ala Val Glu Gly Glu Leu Trp Arg Leu lie Asp Pro Gin Gly 

260 265 270 

aag ccc ccc ggc gtg atg gga tgg tgg ccg gcc aag gcc gtc acc ttc 864 
Lvs Ala Pro Gly Val Met Gly Trp Trp Pro Ala Lys Ala Val Thr Phe 
275 280 285 

gtc gac aac cac gat aca ggc tec acg cag gcc atg tgg cca ttc ccc 912 
Val Asp Asn His Asp Thr Gly Ser Thr Gin Ala Met Trp Pro Phe Pro 
290 295 300 

tec gac aag gtc atg cag ggc tac gcg tac ate etc acc cac ccc ggc 960 
Ser Aso Lys Val Met Gin Gly Tyr Ala Tyr He Leu Thr His Pro Gly 
305 310 315 320 

acc cca cgc ate ttc tac gac cat ttc ttc aac tgg ggg ttt aag gac 1008 
He Pro Cys He Phe Tvr Asp His Phe Phe Asn Trp Gly Phe Lys Asp 

325 " 330 335 

cag ate gcg gcg ctg gtg gcg ate agg aag cgc aac ggc ate acg gcg 1056 
Gin Tie Ala Ala Leu Val Ala He Arg Lys Arg Asn Gly lie Thr Ala 

340 345 350 

acg age get ctg aag ate etc atg cac gaa gga gat gcc tac gtc gcc 1104 
Thr Ser Ala Leu Lys He Leu Met His Glu Gly Asp Ala Tyr Val Ala 
355 360 365 

gag ata gac ggc aag gtg gtg gtg aag ate ggg tec agg tac gac gtc 1152 
Glu He Asp Gly Lys Val Val Val Lys He Gly Ser Arg Tyr Asp Val 
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370 375 380 

ggg gcg q ccj ate ccg gec ggg tec gtq acc teg gca cac ggc aac gac 1200 

Gi / Ala Val lie Pro Ala Gly Phe Va L Thr Ser Ala His Gly Asn Asp 

335 390 395 400 

tac gec gtc tgg gag aag aac ggc gec gcg gca aca eta caa egg age 1248 

Tvr Ala Val Trp Glu Lys Asn Gly Ala Ala Ala Thr Leu Gin Arg Ser 

4 05 4 10 4 15 



<210> 88 
<2l\> 416 
<212> PRT 

< 2 1 3> Hcrdeum vulgare 

<400> S3 

Met Ala His Gin Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys 
15 10 15 



Gin Ser Giy Gly Tro Tvr Asn Met Met Met Gly Lys Val Asp Asp lie 

20 25 30 



Ala Ala Ala Gly Val Thr His Val Trp Leu Pro Pro Pro Ser His Ser 
35 4 0 4 5 



Val Ser Asn Glu Gly Tvr Met Pro Gly Arg Leu Tyr Asp lie Asp Ala 
50 55 60 



Ser Lys Tyr Gly Asn Ala Ala Glu Leu Lys Ser Leu lie Gly Ala Leu 
65 70 75 80 



His Glv Lys Gly Val Gin Ala lie Ala Asp He Val lie Asn His Arg 

85 90 95 



Cvs Ala Asp Tyr Lys Asp Ser Arg Gly He Tyr Cys He Phe Glu Gly 

100 ' 105 HO 



Gly Thr Ser Asp Gly Arg Leu Asp Trp Gly Pro His Met He Cys Arg 
115 120 125 



Asp Asp Thr Lys Tyr Ser Asp Gly Thr Ala Asn Leu Asp Thr Gly Ala 
130 135 140 



Asp Phe Ala Ala Ala Pro Asp He Asp His Leu Asn Asp Arg Val Gin 
145 150 155 160 
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Arg Glu Leu Lys 



Asp Ala Trp Arg 

180 



Lys Val Tyr lie 
195 



Trp Asp Asn Met 
210 



Asp Ala His Arg 
225 



Ala Ala Ser Ala 



Asn Ala Ala Val 

260 



Lys Ala Pro Gly 
27 5 



Val Asd Asn His 
2 90 



Ser Asp Lys Val 
305 



lie Pro Cys lie 



Gin lie Ala Ala 

340 



Thr Ser Ala Leu 
355 



Glu He Asp Gly 
370 



Gly Ala Val lie 
385 



Glu Trp Leu Leu 
165 



Leu Asp Phe Ala 



Asp Gly Thr Ser 

200 



Ala Thr Glv Gly 
215 



Gin Asn Leu Val 
230 



Gly Met Val Phe 
2 4 5 



Glu Gly Glu Leu 



Val Met Gly Trp 

280 



Asp Thr Gly Ser 
295 



Mec Gin Glv Tyr 
310 



Phe Tyr Asp His 
325 



Leu Val Ala lie 



Lys lie Leu Met 

360 



Lys Val Val Val 
375 



Pro Ala Gly Phe 
390 



Trp Leu Lys Ser 
170 



Arg Gly Tyr Ser 
185 



Pro Ser Leu Ala 



Asp Gly Lys Pro 

220 



Asn Trp Val Asp 
235 



Asp Phe Thr Thr 
250 



Trp Arg Leu lie 
265 



Trp Pro Ala Lys 



Thr Gin Ala Met 

300 



Ala Tyr lie Leu 
315 



Phe Phe Asn Trp 
330 



Arg Lys Arg Asn 
345 



His Glu Gly Asp 



Lys lie Gly Ser 

380 



Val Thr Ser Ala 
395 



Asp Leu Gly Phe 
175 



Pro Glu Met Ala 
190 



Val Ala Glu Val 
205 



Asn Tyr Asp Gin 



Lys Val Gly Gly 

240 



Lys Gly He Leu 
255 



Asp Pro Gin Gly 
270 



Ala Val Thr Phe 
285 



Trp Pro Phe Pro 



Thr His Pro Gly 

320 



Gly Phe Lys Asp 
335 



Gly He Thr Ala 
350 



Ala Tyr Val Ala 
365 



Arg Tyr Asp Val 



His Gly Asn Asp 

400 
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Tyr Ala Val Trp Glu Lys Asn Gly Ala Ala Ala Thr Leu Gin Arg Ser 

405 410 415 



<210> 89 

<2 1 1 > 1401 

<212> DNA 

< 2 1 3 > Artificial Sequence 
<220> 

<223> Trichoderma reesei ft-Glucosidase 2 



<220> 

<22!> CDS 

<222> (1) . . (1401) 

<223> Trichoderma reesei fr-Glucosidase 2 

<400> 89 

acg ctg ccc aag gac ttt cag tgg ggg ttc gcc acg get gcc tac cag 48 
Met Leu Pro Lys Asp Phe Gin Trp Gly Phe Ala Thr Ala Ala Tyr Gin 
15 10 15 

ate gag ggc gcc gcc gac cag gac ggc cgc ggc ccc age ate tgg gac 96 
lie Glu Gly Ala Val Asp Gin Asp Gly Arg Gly Pro Ser lie Trp Asp 

20 25 30 

acg ttc cgc gcg cag ccc ggc aag ate gcc gac ggc teg teg ggc gtg 144 
Thr Phe Cys Ala Gin Pro Gly Lys lie Ala Asp Gly Ser Ser Gly Val 
35 40 45 

acg gcg tgc gac teg tac aac cgc acg gcc gag gac att gcg ctg ccg 192 
Thr Ala Cys Asp Ser Tyr Asn Arg Thr Ala Glu Asp lie Ala Leu Leu 
50 55 60 

aag teg etc ggg gcc aag age tac cgc ttc tec ate teg tgg teg cgc 240 
Lvs Ser Leu Gly Ala Lys Ser Tyr Arg Phe Ser lie Ser Trp Ser Arg 
65 70 75 80 

acc acc ccc gag ggc ggc cgc ggc gat gcc gtc aac cag gcg ggc ate 288 
lie lie Pro Glu Gly Gly Arg Gly Asp Ala Val Asn Gin Ala Gly He 

85 90 95 

gac cac tac gcc aag tec gcc gac gac ctg etc gac gcc ggc ate acg 336 
Asp His Tyr Val Lys Phe Val Asp Asp Leu Leu Asp Ala Gly lie Thr 

100 105 110 

ccc tec acc acc etc tec cac tgg gac ctg ccc gag ggc ctg cac cag 384 
Pro Phe lie Thr Leu Phe His Trp Asp Leu Pro Glu Gly Leu His Gin 
115 120 125 

egg cac ggg ggg ccg ctg aac cgc acc gag tec ccg etc gac ttt gaa 432 
Arg Tyr Gly Gly Leu Leu Asn Arg Thr Glu Phe Pro Leu Asp Phe Glu 
130 135 140 

aac tac ccc cgc gcc atg ttc agg gcg ctg ccc aag gtg cgc aac tgg 480 



Page 98 



WO 2005/096804 



PCTAJS2004/007182 



Asn Tyr Ala Arg Val Met Phe Arg Ala Leu Pro Lys Val Arg Asn Trp 

145 150 155 ~ 160 

arc acc tec aac gag ccg ctg tgc teg gec ate ccg ggc tac ggc tec 528 

lie Thr Phe Asn Glu Pro Leu Cys Ser Ala lie Pro Gly Tyr Gly Ser 

165 170 175 

ggc acc ttc gec ccc ggc egg cag age acc teg gag ccg tgg acc gtc 576 

Gly Thr Phe Ala Pro Gly Arg Gin Ser Thr Ser Glu Pro Trp Thr Val 

180 185 190 



ggc cac aac ate etc gtc gec cac ggc cgc gee gtc aag gcg tac cgc 
Gly His Asn lie Leu Val Ala His Gly Arg Ala Val Lys Ala Tyr Arg 
195 200 205 



gac egg ctg ccg acc ttt acg ccc gag gag cgc gee etc gtc cac ggc 
Asp Arg Leu Pro Thr Phe Thr Pro Glu Glu Arg Ala Leu Val His Gly 
275 280 285 



cac cgc age teg ccc gec tec gee gac gac acc gtc ggc aac gtc gac 
His Arq Ser Ser Pro Ala Ser Ala Asp Asp Thr Val Gly Asn Val Asp 
305 310 315 320 



624 



gac gac ttc aag ccc gee age ggc gac ggc cag ate ggc ate gtc etc 672 

Asp Asp Phe Lys Pro Ala Ser Gly Asp Gly Gin lie Gly He .Val Leu 

210 215 220 

aac ggc gac ttc acc tac ccc tgg gac gee gee gac ccg gec gac aag 720 

Asn Gly Asp Phe Thr Tyr Pro Trp Asp Ala Ala Asp Pro Ala Asp Lys 

225 230 235 240 

gag gcg gee gag egg cgc etc gag ttc ttc acg gee tgg ttc gcg gac 768 

Glu Ala Ala Glu Arg Arg Leu Glu Phe Phe Thr Ala Trp Phe Ala Asp 

245 250 255 

ccc ate tac ttg ggc gac tac ccg gcg teg atg cgc aag cag ctg ggc 816 

Pro lie Tyr leu Glv Asp Tyr Pro Ala Ser Met Arg Lys Gin Leu Gly 

260 265 270 



864 



tec aac gac ttt tac ggc atg aac cac tac acg tec aac tac ate cgc 912 
Ser Asn Asp Phe Tyr Gly Met Asn His Tyr Thr Ser Asn Tyr He Arg 
290 295 300 



960 



gcg etc ttc acc aac aag cag ggc aac tgc ate ggc ccc gag acg cag 1008 

Val Leu Phe Thr Asn Lys Gin Gly Asn Cys He Gly Pro Glu Thr Gin 

325 330 335 

tec ccc tgg ctg cgc ccc tgt gee gee ggc ttc cgc gac ttc ctg gtg 1056 

Ser Pro Trp Leu Arg Pro Cys Ala Ala Gly Phe Arg Asp Phe Leu Val 

340 345 350 

tgg ate age aag agg tac ggc tac ccg ccc ate tac gtg acg gag aac 1104 

Trp lie Ser Lys Arg Tyr Gly Tyr Pro Pro lie Tyr Val Thr Glu Asn 
355 360 365 

ggc acg age ate aag ggc gag age gac ttg ccc aag gag aag att etc 1152 

Gly Thr Ser lie Lys Gly Glu Ser Asp Leu Pro Lys Glu Lys He Leu 

370 375 380 
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gaa gat gac etc agg gtc aag tac tat aac gag tac ate cgt gec atg 1200 

Glu Asp As d Phe Arg Val Lys Tyr Tyr Asn Glu Tyr lie Arg Ala Met 

385 390 395 400 

gtt acc gec gtg gag ctg gac ggg gtc aac gtc aag ggg tac ttt gec 1248 

Val Thr Ala Val Glu Leu Asp Gly Val Asn Val Lys Gly Tyr Phe Ala 

405 410 415 

tgg teg etc atg gac aac ttt gag tgg gcg gac ggc tac gtg acg agg 1296 

Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Asp Gly Tyr Val Thr Arg 

420 425 430 

etc ggg gtt acg tat gtg gat tat gag aat ggg cag aag egg ttc ccc 1344 

Pne Gly Val Thr Tyr Val Asp Tyr Glu Asn Gly Gin Lys Arg Phe Pro 
4 35 4 40 44 5 

aag aag age gca aag acc ttg aag ccg ctg ttt gac gag ctg att gcg 1392 

Lys Lvs Ser Ala Lvs Ser Leu Lys Pro Leu Phe Asp Glu Leu He Ala 
4 50 " 4 55 4 60 



gcg gcg tga 
Ala Ala 
465 



< 2 1 0 > 90 

< 2 1 1 > '166 

< 2 1 2 > PRT 

< 2 1 3 > Artificial Sequence 
<220> 

< 2 2 3> Synthetic Construct 

<400> 90 



Met Leu Pro Lys Asp Phe Gin Trp Gly Phe Ala Thr Ala Ala Tyr Gin 
15 10 15 



Tip Glu Giv Ala Val Asp Gin Asp Gly Arg Gly Pro Ser He Trp Asp 

20 25 30 



Thr Phe Cys Ala Gin Pro Gly Lys lie Ala Asp Gly Ser Ser Gly Val 



0 45 



Thr Ala Cvs Asp Ser Tyr Asn Arg Thr Ala Glu Asp lie Ala Leu Leu 
50 ' 55 60 



Lys Ser Leu Gly Ala Lys Ser Tyr Arg Phe Ser lie Ser Trp Ser Arg 
65 70 75 80 



lie lie Pro Glu Gly Gly Arg Gly Asp Ala Val Asn Gin Ala Gly He 

85 90 95 



1401 
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Asd His Tyr Val Lys Phe Val Asp Asp Leu Leu Asp Ala Gly lie Thr 

100 105 110 



Pre Phe He Thr Leu Phe His Tr d Asp Leu Pro Glu Gly Leu His Gin 
115 120 125 



Arg Tyr Gly Gly Leu Leu Asn Arg Thr Glu Phe Pro Leu Asp Phe Glu 
130 ' 135 140 



Asn Tyr Ala Arg Val Met Phe Arg Ala Leu Pro Lys Val Arg Asn Trp 
H5 J 150 155 160 



lie Thr Phe Asn Glu Pro Leu Cys Ser Ala lie Pro Gly Tyr Gly Ser 

165 HO 175 



Gly Thr Phe Ala Pro Gly Arg Gin Ser Thr Ser Glu Pro Trp Thr Val 

180 185 190 



Gly His Asn lie Leu Val Ala His Gly Arg Ala Val Lys Ala Tyr Arg 
195 200 205 



Asp Asp Phe Lys Pro Ala Ser Glv Asp Gly Gin lie Gly He Val Leu 
210 215 220 



Asn Glv Asp Phe Thr Tyr Pro Trp Asp Ala Ala Asp Pro Ala Asp Lys 

230 235 240 

w 



Glu Ala Ala Glu Arg Arg Leu Glu Phe Phe Thr Ala Trp Phe Ala Asp 

245 250 255 



?-r. j\e Tyr Leu Gly Asp Tyr Pro Ala Ser Met Arg Lys Gin Leu Gly 

260 ' 265 270 



Asp Arg Leu Pro Thr Phe Thr Pro Glu Glu Arg Ala Leu Val His Gly 
275 280 285 



Ser Asn Asp Phe Tyr Gly Met Asn His Tyr Thr Ser Asn Tyr lie Arg 
290 * ' 295 300 



His Arg Ser Ser Pro Ala Ser Ala Asp Asp Thr Val Gly Asn Val Asp 
305 " 310 315 320 
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Val Leu Phe Thr Asn Lys Gin Gly Asn Cys lie Gly Pro Glu Thr Gin 

325 330 335 



Ser Pro Trp Leu Arg Pro Cys Ala Ala Gly PheArg Asp Phe Leu Val 

340 345 350 



Trp lie Ser Lys Arg Tyr Gly Tyr Pro Pro He Tyr Val Thr Glu Asn 
355 ' 360 365 



Gly Thr Ser He Lys Gly Glu Ser Asp Leu Pro Lys Glu Lys He Leu 
370 " 375 380 



Glu Asp Asp Phe Arg Val Lys Tyr Tyr Asn Glu Tyr lie Arg Ala Met 
365 390 395 400 



Val Thr Ala Val Glu Leu Asp Gly Val Asn Val Lys Gly Tyr Phe Ala 

405 410 415 



Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Asp Gly Tyr Val Thr Arg 

S20 425 430 



Phe Gly Val Thr Tyr Val Asp Tyr Glu Asn Gly Gin Lys Arg Phe Pro 
435 440 445 



Lys Lys Ser Ala Lys Ser Leu Lys Pro Leu Phe Asp Glu Leu He Ala 
450 455 460 



A la Ala 
4 65 



<2\0> 91 

<231> 2103 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Trichoderma reesei ft-Glucos idase D 



<220> 

<22i> CDS 

<222> {If . . (2103) 

<223> Trichoderma reesei ft-Glucosidase D 
<400> 91 

atg att etc ggc tgt gaa age aca ggt gtc ate tct gee gtc aaa cac 
Met He Leu Gly Cys Glu Ser Thr Gly Val He Ser Ala Val Lys His 
15 10 15 



48 
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ccc gcc gcc aac gac cag gag cac gag egg cga gcg gtc gac tgt etc 
Phe Val Ala Asn Aso Gin Giu His Glu Arg Arg Ala Val Asp- Cys Leu 

20 25 30 



gta gcc cga gac gca agg ccc ggc gca ttg atg aca tec tac aac aag 

Val Ala Arg Asp Ala Arg Pro Gly Ala Leu Met Thr Ser Tyr Asn Lys 

50 55 60 

gcc aat ggc aag cac gcc get gac age gcc gag ttc ctt cag ggc att 

Val Asn Gly Lys His Val Ala Asp Ser Ala Glu Phe Leu Gin Gly lie 

65 "70 75 80 

etc cag act gag tgg aat tgg gac cct etc att gtc age gac tgg tac 

Leu Arg Thr Glu Trp Asn Trp Asp Pro Leu lie Val Ser Asp Trp Tyr 

85 90 95 

ggc acc cac acc act act gat gcc ate aaa gcc ggc ctt gat etc gag 

Gly Thr Tyr Thr Thr lie Asp Ala He Lys Ala Gly Leu Asp Leu Glu 

100 105 110 



cag gcc cgt ttg ctg aag cag tec act ate gat gag cgc get cgc cgc 

Gin Ala Arg Leu Leu Lys Gin Ser Thr lie Asp Glu Arg Ala Arg Arg 

130 135 HO 

gcg ccc agg etc gcc cag aag gcc age cat etc aag gtc tec gag gta 

Va 1 Leu Arg Phe Ala Gin Lys Ala Ser His Leu Lys Val Ser Glu Val 

145 150 155 160 

cag caa ggc cgt gac ccc cca gag gat cgc gtc etc aac cgt cag ate 

Giu Gin Gly Arg Asp Phe Pro Glu Asp Arg Val Leu Asn Arg Gin lie 

165 170 175 

tgc ggc age age att gtc eta ctg aag aat gag aac tec ate tta cct 

Cys Gly Ser Ser lie Val Leu Leu Lys Asn Glu Asn Ser He Leu Pro 

180 185 190 

ccc ccc aag ccc gcc aag aag gtc gcc ctt gtt ggt tec cac gtg cgt 

Leu Pro Lys Ser Val Lys Lys Val Ala Leu Val Gly Ser His Val Arg 

195 200 205 

eta ccg cct acc teg gga gga ggc age gcc tct ctt gtc cct tac tat 

Leu Pro Ala He Ser Gly Gly Gly Ser Ala Ser Leu Val Pro Tyr Tyr 

210 215 220 

gcc aca ccc cca cac gac gcc gcc tec gag gta eta gcc ggt gcc acg 

Ala lie Ser Leu Tyr Asp Ala Val Ser Glu Val Leu Ala Gly Ala Thr 

225 230 235 240 

ate acg cac gag gcc ggt gcc tat gcc cac caa atg ctg ccc gtc ate 



96 



acc acc cag egg gcc etc egg gag gtc tat ctg cga ccc ttc cag ate 144 
lie Thr Gin Arg Ala Leu Arg Glu Val Tyr Leu Arg Pro Phe Gin lie 
35 40 45 



192 



240 



288 



336 



atg ccg ggc gcc tea cga cat cgc ggc aaa tac ate gag tct get ctg 384 
Met Pro Gly Val Ser Arg Tyr Arg Gly Lys Tyr lie Glu Ser Ala Leu 
1 i 5 120 125 



432 



480 



528 



576 



624 



672 



720 



768 
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lie Thr His Glu Val Gly Ala Tyr Ala His Gin Met Leo Pro Val He 

245 250 255 

sac aca atg ate age aac gee gta ate cac ttc tac aac gac ccc ate 
Asp Ala Met lie Ser Asn Ala Val lie His Phe Tyr Asn Asp Pro lie 

260 265 270 

gat gtc aaa gac aga aag etc ctt ggc agt gag aac gta teg teg aca 
Asp Val Lys Asp Arg Lys Leu Leu Gly Ser Glu Asn Val Ser Ser Thr 
275 280 285 

ccq etc cag etc atg gat tac aac aac ate cca acg etc aac aag gee 
5er Phe Gin Leu Met Asp Tyr Asn Asn lie Pro Thr Leu Asn Lys Ala 

290 295 300 

at-o ttc tgg ggc act etc gtg ggc gag ttt ate cct ace gee acg gga 
Met Phe Trp Gly Thr Leu Val Gly Glu Phe lie Pro Thr Ala Thr Gly 
305 310 315 320 

act egg gaa ttc ggc etc agt gtc ttt ggc act gee gac ctt tat att 
lie Trp Glu Phe Gly Leu Ser Val Phe Gly Thr Ala Asp Leu Tyr He 

325 330 335 

gat aac gag etc gtg acc gaa aat aca aca cat cag acg cgt gga ace 
Asp Asn Glu Leu Val lie Glu Asn Thr Thr His Gin Thr Arg Gly Thr 

340 345 350 

acc ttt ttc gga aag gga acg acg gaa aaa gtc get acc agg agg atg 
»la D he Phe Gly Lys Gly Thr Thr Glu Lys Val Ala Thr Arg Arg Met 
355 360 365 



acg acc aag atg gac acg acc ggc get gtc aac ttt ggc ggc ggt gee 

^hr Thr Lys Met Glu Thr Thr Gly Val Val Asn Phe Gly Gly Gly Ala 

385 ' 390 395 400 

g^a cac ccg ggt gee cgt etc aag gec gac cca cag gag atg att gcg 

Val His Leu Gly Ala Cys Leu Lys Val Asp Pro Gin Glu Met lie Ala 

405 410 415 

cag gec gtc aag gee gca gec gat gec gac tac acc ate ate tgc acg 

Arq Ma Val Lys Ala Ala Ala Asp Ala Asp Tyr Thr lie lie Cys Thr 

,120 425 4 30 

gca etc age gec gag tgg gag tct gag ggt ttt gac egg cct cac atg 

Gly Leu Ser Gly Glu Trp Glu Ser Glu Gly Phe Asp Arg Pro His Met 

435 4 40 4 45 

gac ccg ccc cct ggt gcg gac acc acg ate teg caa gtt ctt gac gec 

Asp Leu Pro Pro Gly Val Asp Thr Mec He Ser Gin Val Leu Asp Ala 

450 455 460 

acc ccc aac get gta gec gtc aac cag tea ggc acc cca gtg aca atg 

£i a p ro Asn Ala Val Val Val Asn Gin Ser Gly Thr Pro Val Thr Met 

465 * 4 7 0 475 480 



816 



864 



912 



960 



1008 



1056 



1104 



gtg gec ggc age acc tac aag ctg cgt etc gag ttt ggg tct gee aac 1152 
v; a i /*i a Gly Ser Thr Tyr Lys Leu Arg Leu Glu Phe Gly Ser Ala Asn 
370 375 380 



1200 



1248 



1296 



1344 



1392 



1440 
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age cog get cac aaa gca aag gec act gtg cag get tgg tat ggt ggt 1488 
Ser Trp Ala His Lys Ala Lys Ala He Val Gin Ala Trp Tyr Gly Gly 

485 490 495 



aae gag aca ggc cac gga ate tec gat gtg etc ttt ggc aac gtc aac 

Asn Glu Thr Gly His Gly He Ser Asp Val Leu Phe Gly Asn Val Asn 

500 505 510 

ccg teg ggg aaa etc tec eta teg tgg cca gtc gat gtg aag cac aac 

Pro Ser Gly Lys Leu Ser Leu Ser Trp Pro Val Asp Val Lys His Asn 

515 520 525 

cca gca tat etc aac tac gee age gtt ggt gga egg gtc ttg tat ggc 

Pro Ala Tyr Leu Asn Tyr Ala Ser Val Gly Gly Arg Val Leu Tyr Gly 

530 535 540 

gag gat act Cac gtt ogc tac aag etc tac gac aaa acg gag agg gag 

Glu Asp Val Tyr Val Gly Tyr Lys Phe Tyr Asp Lys Thr Glu Arg Glu 

54* 550 555 560 



cca gat tct ace atg agg acg gtc ccc gaa acc ttc cac ccg gac cag 

Pro Asp Ser Thr Val Arg Thr Val Pro Glu Thr Phe His Pro Asp Gin 

580 585 590 

ccc aca gta gee att gtc aag ate aag aac acg age agt gtc ccg ggc 

Pro Thr Val Ala lie Val Lys lie Lys Asn Thr Ser Ser Val Pro Gly 

595 600 605 

g<-c cag etc ccg cag tta tac att teg gee cca aac teg cct aca cat 

A>a Gin val Leu Gin Leu Tyr lie Ser Ala Pro Asn Ser Pro Thr His 

610 615 620 

cgc ccg gtc aag gag ctg cac gga ttc gaa aag gtg tat ctt gaa get 

Arq Pro Val Lys Glu Leu His Gly Phe Glu Lys Val Tyr Leu Glu Ala 

625 630 635 640 

ggc aag gag aag gag gta caa ata ccc att gac cag tac get act age 

Gly Glu Glu Lys Glu Val Gin lie Pro He Asp Gin Tyr Ala Thr Ser 

64 5 650 655 

etc teg qac gag act gag age atg tgg aag age gag agg ggc att tat 

Phe Tco Asp Glu lie Glu Ser Met Trp Lys Ser Glu Arg Gly lie Tyr 

660 665 670 

gat gtg etc gca gga etc ccg agt cag gaa ate teg ggc aag ggg aag 

Asp Val Leu Val Gly Phe Ser Ser Gin Glu lie Ser Gly Lys Gly Lys 

675 680 685 



1536 



1584 



1632 



1680 



gtt ccg ctt cct ttt ggg cat ggc ctg tct tac get acc ttc aag etc 1728 
Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Ala Thr Phe Lys Leu 

565 570 575 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



eta att gtg cct gaa acg cga tec tgg acg ggg ctg tag 2103 
Leu He Val Pro Glu Thr Arg Phe Trp Met Gly Leu 
690 695 700 
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<210> 92 

<2\\> 700 

< 2 i 2 > PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 92 

M*t He L*u Gly Cys Glu Ser Thr Gly Val lie Ser Ala Val Lys His 
1 5 10 15 

Phe v a l Ala Asn Asp Gin Glu His Glu Arg Arg Ala Val Asp Cys Leu 

20 2 5 30 

Tie Thr Gin Arg Ala Leu Arg Glu Val Tyr Leu Arg Pro Phe Gin lie 
35 4 0 4 5 

Val Ala Arg Asp Ala Arg Pro Gly Ala Leu Met Thr Ser Tyr Asn Lys 
50 55 60 

"al Asn Gly Lys His Val Ala Asp Ser Ala Glu Phe Leu Gin Gly lie 
*s 70 ^5 80 

Leu Arq Thr Glu Trp Asn Trp Asp Pro Leu lie Val Ser Asp Trp Tyr 

65 90 95 

Glv ^hr Tyr Thr Thr lie Asp Ala lie Lys Ala Gly Leu Asp Leu Glu 
' ' 100 105 HO 



Met Pro Gly Val Ser Arg Tyr Arg Gly Lys Tyr lie Glu Ser Ala Leu 
H5 120 125 

G'n A>a Arg Leu Leu Lvs Gin Ser Thr lie Asp Glu Arg Ala Arg Arg 
130 135 140 

Val Leu Arg Phe Ala Gin Lys Ala Ser His Leu Lys Val Ser Glu Val 
US 150 155 160 

Glu Gin Gly Arg Asp Phe Pro Glu Asp Arg Val Leu Asn Arg Gin He 

165 170 175 

rys Gly Ser Ser lie Val Leu Leu Lys Asn Glu Asn Ser lie Leu Pro 

180 185 190 
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Leu Pro Lys Ser Val Lys Lys Val Ala Leu Val Gly Ser His Val Arg 
195 * 200 205 



Leu Pro Ala lie Ser Gly Gly Gly Ser Ala Ser Leu Val Pro Tyr Tyr 

210 215 220 

Ma Ser Leu Tyr Asd Ala Val Ser Glu Val Leu Ala Gly Ala Thr 

22b 230 235 240 

Tie Thr His Glu Val Gly Ala Tvr Ala His Gin Met Leu Pro Val lie 

245 250 255 



Asp Ala Met lie Ser Asn Ala Val lie His Phe Tyr Asn Asp Pro He 

260 265 270 



Asp Val Lys Asp Arg Lys Leu Leu Gly Ser Glu Asn Val Ser Ser Thr 
275 280 285 



Q P r pho Gin Leu Met Asp Tyr Asn Asn He Pro Thr Leu Asn Lys Ala 
290 295 300 

Met Phe Trp Gly Thr Leu Val Gly Glu Phe lie Pro Thr Ala Thr Gly 
305 310 315 320 

He T rp Glu Phe Gly Leu Ser Val Phe Gly Thr Ala Asp Leu Tyr He 

325 330 335 



Asp Asn Glu Leu Val He Glu Asn Thr Thr His Gin Thr Arg Gly Thr 

340 345 350 



Ala Phe Phe Gly Lys Gly Thr Thr Glu Lys Val Ala Thr Arg Arg Met 
355 360 365 



Vai Ala Gly Ser Thr Tyr Lys Leu Arg Leu Glu Phe Gly Ser Ala Asn 
370 ' 375 380 



Th*- Thr Lys Met Glu Thr Thr Gly Val Val Asn Phe Gly Gly Gly Ala 
385 390 395 400 

Val His Leu Gly Ala Cys Leu Lys Val Asp Pro Gin Glu Met He Ala 

4 05 410 415 



A^q Ala Val Lys Ala Ala Ala Asp Ala Asp Tyr Thr He He Cys Thr 

420 4 25 4 30 
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Gly Leu Ser Gly Glu Trp Glu Ser Glu Gly Phe Asp Arg Pro His Met 

435 440 445 

Asp Leu Pro Pro Gly Val Asp Thr Met lie Ser Gin Val Leu Asp Ala 
4 50 4 55 4 60 



Ai a Pro Asn Ala Val Val Val Asn Gin Ser Gly Thr Pro Val Thr Met 

465 470 475 480 

Ser Trp Ala His Lys Ala Lys Ala lie Val Gin Ala Trp Tyr Gly Gly 

485 ' 490 495 



Asn Glu Thr Gly His Gly lie Ser Asp Val Leu Phe Gly Asn Val Asn 

500 ^05 510 



c-o Ser Gly Lys Leu Ser Leu Ser Trp Pro Val Asp Val Lys His Asn 
515 520 525 

Pro nia Tyr Leu Asn Tyr Ala Ser Val Gly Gly Arg Val Leu Tyr Gly 
530 535 540 

Glu Asp Val Tyr Val Gly Tyr Lys Phe Tyr Asp Lys Thr Glu Arg Glu 
545 550 555 560 

Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Ala Thr Phe Lys Leu 

565 570 575 

Pro Asp Ser Thr Val Arg Thr Val Pro Glu Thr Phe His Pro Asp Gin 

580 " 585 590 



Pro Thr Val Ala lie Val Lys lie Lys Asn Thr Ser Ser Val Pro Gly 
595 600 605 



Aia Gin Val Leu Gin Leu Tyr lie Ser Ala Pro Asn Ser Pro Thr His 

610 615 620 

*rq Pro Val Lvs Giu Leu His Gly Phe Glu Lys Val Tyr Leu Glu Ala 

&25 * 630 635 640 

r,iu g»u Glu Lys Giu Val Gin lie Pro lie Asp Gin Tyr Ala Thr Ser 

645 650 655 
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Phe Trp Asp Glu lie Glu Ser Met Trp Lys Ser Glu Arg Gly-Ile Tyr 

660 665 670 



Asp Val Leu Val Gly Phe Ser Ser Gin Glu lie Ser Gly Lys Gly Lys 
675 680 685 



Leu lie Vol Pro Glu Thr Arg Phe Trp Met Gly Leu 
690 695 700 

<210> 93 

<2l\> 14 96 

< 21 2> DNA 

< 2 1 2 > Artificial Sequence 
<220> 



<22 3> Maiz 


e optimized 


1 CBHI 










<400> 93 
tgcag t ccgc 


oigcaccctc 






actcacctaa 


caaaaq t act 


60 


cctccggcgg 


cacct gcacc 


^a gca g a ccg 






aactaacact 


120 


ggaeccacgc 


caccaactcc 


C CCa CLddCt 


n \~ ^ c\ r\ 


^aacafT't an 

L (J u L u u u ^- y y 


t cctccaccc 


180 


t ctgcccgga 


caacgagacc 


cgcgccaaga 






acct acacct 


240 


ccacccacgg 


cgtgaccacc 


t ccggcaact 


CCClCt CCa L 






300 

^/ V 


cccagaagaa 


cgtgggcgcc 


cgcct ctacc 


tcatggcctc 


cgacaccacc 


taccaggagt 


3 60 


t caccctcct 


cggcaacgag 


ttctccttcg 


acgtggacgt 


gtcccagctc 


ccgtgcggcc 


420 


tcaacggcgc 


cc tctacttc 


gtgtccatgg 


acgccgacgg 


cggcgtgtcc 


aagtacccga 


480 


ccaacaccgc 


cggcgccaag 


tacggcaccg 


gctactgcga 


ctcccagtgc 


ccgcgcgacc 


540 


tcaagttcat 


caacggccag 


gccaacgtgg 


agggctggga 


gccgtcctcc 


aacaacgcca 


600 


acaccggca t 


cggcggccac 


ggct cctgct 


gctccgagat 


ggaca tctgg 


gaggccaact 


660 


ccatctccga 


ggccctcacc 


ccgcacccgt 


gcaccaccgt 


gggccaggag 


atctgcgagg 


720 


gcgacggctg 


cggcggcacc 


tactccgaca 


accgctacgg 


cggcacctgc 


gacccggacg 


780 


gctgcgactg 


gaacccgt ac 


cgcctcggca 


acacctcct t 


ctacggcccg 


ggctcct cct 


840 


ucaccctcqa 


caccaccaag 


aagctcaccg 


t ggtgaccca 


gt tcgagacc 


tccggcgcca 


900 


tcaaccgcta 


ctacgtgcag 


aacggcgtga 


cct tccagca 


gccgaacgcc 


gagctcggct 


960 


cccact ccgg 


caacgagctc 


aacgacgact 


actgcaccgc 


cgaggaggcc 


gagt tcggcg 


1020 


gctcctcct t 


ctccgacaag 


ggcggcctca 


cccagt tcaa 


gaaggccacc 


tccggcggca 


1080 


tggtgcccgt 


ga tgtccctc 


cgggacgact 


actacgccaa 


catgctctgg 


ctcgactcca 


1140 
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cc tacccodc 


caacqaqa cc 


ccccccaccc 


cgggcgccgt 


gcgcggctcc 


tgctccacct 


1200 


cctccggcct 


gccggcccag 


gtggagtccc 


agtccccgaa 


cgccaaggtg 


accttctcca 


1260 


aca tcaagt c 


cggcccga tc 


ggctccaccg 


gcaacccgtc 


cggcggcaac 


ccgccgggcg 


1320 


gcaacccgcc 


gggcaccacc 


accacccgcc 


gcccggccac 


caccaccggc 


tcctccccgg 


1380 


gccegaccca 


gtcccactac 


ggccagtgcg 


gcggcatcgg 


ctactccggc 


ccgaccgtgt 


1440 


gcgcctccgg 


caccacctgc 


caggtgctca 


acccgtacta 


ctcccagtgc 


ctctag 


1496 



<210> 94 

< 2 1 1 > 1365 
< 2 1 2 > DNA 

< 2 1 3 > Arcificial Sequence 
<220> 

<22 3> Maize optimized CBHII 
<400> 94 

atggtgccgc tcgaggagcg ccaggcctgc tcctccgtgt ggggccagtg cggcggccag 60 

aactggtccg gcccgacctg ctgcgcctcc ggctccacct gcgtgtactc caacgactac 120 

cactcccagc gcctcccggg cgccgcctcc tcctcctcct ccacccgcgc cgcctccacc 

accccccgcg tgtccccgac cacctcccgc tcctcctccg ccaccccgcc gccgggctcc 

accaccaccc gcgtgccgcc ggtgggctcc ggcaccgcca cctactccgg caacccgttc 

gtgggcgcga ccccgcgggc caacgcctac tacgcctccg aggcgtcctc cctcgccatc 

ccgtccccca ccggcgccat ggccaccgcc gccgccgccg tggccaaggt gccgtccttc 

atgcqgctcg acaccctcga caagaccccg ctcatggagc agaccctcgc cgacatccgc 

accgccaaca agaacggcgg caactacgcc ggccagttcg tggtgtacga cctcccggac 540 

cgcgactgcg ccgccctcgc ctccaacggc gagtactcca tcgccgacgg cggcgtggcc 

aagtacaaga actacatcga caccatccgc cagatcgtgg tggagtactc cgacatccgc 

accctccccq cgatcgagcc ggactccctc gccaacctcg tgaccaacct cggcaccccg 720 

aagtgcgcca acgcccagcc cgcctacctc gagtgcatca actacgccgt gacccagctc 

aacctcccqa acgtggccat gcacctcgac gccggccacg ccggctggct cggctggccg 

gccaaccacg acccggccgc ccagctcccc gccaacgtgt acaagaacqc ctcctccccg 

cgcgcccccc gcggcctcgc caccaacgtg gccaactaca acggctggaa catcacctcc 

ccgccgtccc acacccaggg caacgccgcg tacaacgaga agccctacat ccacgccatc 

ggcccgctcc tcgccaacca cggctggtcc aacgccttct ccatcaccga ccagggccgc 



180 
240 
300 
360 
420 
480 



600 
660 



780 
840 
900 
960 
1020 
1080 
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tccggcaagc 


agccgaccgg 


ccagcagcag 


tggggcgact 


ggtgcaacgt 


gatcggcacc 


1140 


ggct tcggca 


tccgcecgtc 


cgccaacacc 


ggcgactccc 


tcctcgactc 


ct t cgtgtgg 


1200 


gcgaagccgg 


gcggcgagcg 


cgacggcacc 


tccgactcct 


ccgccccgcg 


cttcgactcc 


1260 


cactgcgccc 


t cccggacgc 


cc tccagccg 


gccccgcagg 


ccggcgcctg 


gt t ccaggcc 


1320 


t act ccgtgc 


agctcct cac 


caacgccaac 


ccgtccttcc 


tccag 




1365 



<210> 95 

<211> 1317 

< 2 1 2 > DNA 

<213> Artificial Sequence 

<220> 

<223> Maize optimized EG LI 

<A00> 95 



acgcagcacc 


cgggcacctc 


caccccggag 


g tgcacccga 


agctcaccac 


ctacaagtgc 


60 


accaagcccg 


gcggctgcgt 


ggcccaggac 


acctccgtgg 


tgctcgactg 


gaactaccgc 


120 


tggatgcacg 


acgccaact a 


caactcctgc 


accgtgaacg 


gcggcgtgaa 


caccaccctc 


180 


z qcccggacg 


aggccacctg 


cggcaagaac 


tgct tcatcg 


agggcgtgga 


ctacgccgcc 


240 


tccggcgtga 


ccacctccgg 


ctcctccctc 


accatgaacc 


agtacatgee 


gtcctcctcc 


300 


ggcggctact 


cccccgtgtc 


cccgcgcctc 


tacctcctcg 


actccgacgg 


egagtaegtg 


360 


atgcccaagc 


z caacggcca 


ggagccctcc 


t tcgacgtgg 


acctctccgc 


cct cccgtgc 


420 


ggcgagaacg 


actccct cc a 

— 


cctct cccag 


a tggacgaga 


acggcggcgc 


caaccagtac 


480 


aacaccgccg 


gcgccaacta 


cggct ccggc 


tactgcgacg 


cccagtgccc 


ggtgcagacc 


540 


cggcgcaacc 


gcaccct caa 


cacctcccac 


cagggcttct 


getgeaaega 


gatggacatc 


600 


cccgagggca 


actcccgcgc 


caacgccctc 


accccgcact 


cctgcaccgc 


caccgcctgc 


660 


gac:ccgccc 


gctgcggccc 


caacccgtac 


ggct ccggct 


acaagtccta 


ctacggcccg 


720 


ggcgacaccg 


t ggacacctc 


caagacct tc 


accacca tea 


cccagttcaa 


caccgacaac 


780 


ggc:ccccgc 


ccggcaaccc 


cgtgtccatc 


acccgcaagt 


accagcagaa 


cggcgtggac 


840 


a tcccgt ccg 


cccagccggg 


cggcgacacc 


a tctcctcct 


gcccgtccgc 


ctccgcctac 


900 


ggcggccccg 


ccaccatggg 


caaggccctc 


ccctccggca 


tggtgctcgt 


gttctccatc 


960 


tggaacgaca 


actcccagta 


catgaactgg 


ctcgactccg 


gcaacgccgg 


cccgtgctcc 


1020 


cccaccgagg 


gcaacccgtc 


caacaccctc 


gccaacaacc 


cgaacaccca 


cgtggtgttc 


1080 


zccaacatcc 


gctggggcga 


catcggctcc 


accaccaact 


ccaccgcccc 


gccgccgccg 


1140 
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ccggcctcct ccaccacctt ctccaccacc cgccgctcct ccaccacctc ctcctccccg 1200 
tcctgcacec agacccaccg gggccagtgc ggcggcatcg gctactccgg ctgcaagacc 1260 
tgcacctccg gcaccacctg ccagtactcc aacgactact actcccagtg cctctag 1317 

<210> 96 

<211> HOI 

< 2 1 2 > DNA 

< 2 1 3 > Artificial Sequence 

<220> 

< 2 2 3 > Maize optimized BGLI I 

<4 00> 96 



atgctcccga 


aggact c cca 


qt ggggcttc 


gccaccgccg 


cctaccagat 


cgagggcgcc 


60 


gc eg a cca go 


aeggecgegg 


cccgtcca t c 


tgggacacct 


tctgcgccca 


geegggcaag 


120 


atcgccgacg 


gctcctccgg 


cgtgaccgcc 


tgcgactcct 


acaaccgcac 


cgccgaggac 


180 


at cgccc t cc 


tcaag tccct 


cggcgccaag 


tcctaccgct 


tctccatctc 


ctggtcccgc 


240 


a tea tecegg 


agggeggecg 


cggcgacgcc 


gt gaaccagg 


ccggcatcga 


ccactacgtg 


300 


aagt tcqtgg 


acgacc t cct 


cgacgccggc 


atcaccccgt 


tcatcaccct 


cttccactgg 


360 


gacctcccgg 


agggect cca 


ccagcgctac 


ggcggcctcc 


tcaaccgcac 


cgagttcccg 


420 


ctcgact teg 


agaactaege 


ccgcgtga tg 


t tccgcgccc 


tcccgaaggt 


gcgcaactgg 


480 


accaccctca 


acgagccgct 


ctgctccgcc 


atcccgggct 


acggctccgg 


caccttcgcc 


540 


ccgggccgcc 


agtccacctc 


cgagccgtgg 


accgtgggcc 


acaacatcct 


cgtggcccac 


600 


ggccgcgccg 


tgaaggecta 


ccgcgacgac 


ttcaagcegg 


cctccggcga 


cggccagatc 


660 


ggca t cgege 


teaaeggega 


cttcacctac 


ccgtgggacg 


ccgccgaccc 


ggccgacaag 


720 


gaggccgccg 


agcgccgcct 


cgagt tct tc 


accgcctggt 


tcgccgaccc 


gatctacctc 


780 


ggcgactacc 


cggcctcca t 


gcgcaagcag 


ctcggcgacc 


gcctcccgac 


cttcaccccg 


840 


gaggagegeg 


ccctcgtgca 


cggctccaac 


gact tctacg 


gca tgaacca 


ctacacctcc 


900 


aactacatcc 


gccaccgct c 


ctccccggcc 


t ccgccgacg 


acaccgtggg 


caacgtggac 


960 


gtgecett ca 


ccaacaagca 


gggcaact gc 


at cggcccgg 


agacccagtc 


cccgtggct c 


1020 


cgcccgt gcg 


ccgccggct t 


ccgcgact tc 


ctcgcgtgga 


tctccaagcg 


etaegget ac 


1080 


ccgccga tct 


aegt gaccga 


gaacggcacc 


t cca tcaagg 


gegagtcega 


cctcccgaag 


1140 


gagaaga tec 


tcgaggacga 


ct tecgegtg 


aagtactaca 


acgagtacat 


ccgcgccatg 


1200 


gtgacccccg 


tggagctcga 


eggegt gaac 


gtgaagggct 


acttcgcctg 


gtccctcatg 


1260 
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qacaaczzcq agtgggccga cggctacgtg acccgcttcg gcgtgaccta cgtggactac 1320 
gagaecggcc agaagcgctt cccgaagaag tccgccaagt cccccaagcc gctcttcgac 1380 
gacctcatcg ccgccgccta g 1401 

< 2 1 0 > 97 

<2\\> ?!03 

<2\2> DMA 

<213> Artificial Sequence 
<220> 

<r 22 3 > Maize optimized CEL3D 



<400> 97 
atgatcctcg 


gctgcgagtc 


caccqgcgtg 


at ctccgccg 


tgaagcact t 


cgtggccaac 


bu 


gaccaggagc 


acgagcgccg 


cgccgtggac 


tgcctcatca 


cccagcgcgc 


cct ccgcgag 


i 9n 


gtgtacctcc 


gcccgt tcca 


ga tcgtggcc 


cgcgacgccc 


gcccgggcgc 


cct ca t gacc 




tcctacaaca 


aggtgaacgg 


caagcacgtg 


gccgactccg 


ccgagt tcct 


ccagggca tc 




ctccgcaccg 


agtggaactg 


ggacccgctc 


a tcgtgtccg 


a ^m. mw i— _ mm m± *m—. J*m 

actggt acgg 


cacct aca cc 




acca tcgacg 


cca tcaaggc 


cggcct cgac 


ctcgagatgc 


cgggcgtgtc 


ccgct accgc 




g^caagt aca 


tcgagtccgc 


cct ccaggcc 


cgcct cctca 


agcagt ccac 


ca t cgacgag 




cgcgcccgcc 


gcgtgct ccg 


ct t cgcccag 


aaggcctccc 


acctcaaggt 


gt ccgaggtg 


' 4 q n 


cjagcagggcc 


gcgact t ccc 


ggaggaccgc 


gtgctcaacc 


gccaga tctg 


cggctcctcc 


540 


atcgcgctcc 


tcaagaacga 


gaactcca tc 


ctcccgctcc 


cgaagt ccgt 


gaagaaggtg 


600 


gccctcgtgg 


gctcccacgt 


gcgcct cccg 


gcca tctccg 


gcggcggctc 


cgcctccctc 


660 


gtgccgt act 


acgccatctc 


cctctacgac 


gccgtgtccg 


aggtgctcgc 


cggcgccacc 


720 


atcacccacg 


aggtgggcgc 


ct acgcccac 


caga t gctcc 


cggtgatcga 


cgccatga tc 


780 


tccaacgccg 


tgatccactt 


ctacaacgac 


ccga tcgacg 


tgaaggaccg 


caagctcctc 


840 


ggctccgaga 


acgtgtcctc 


cacct ccttc 


cagct catgg 


actacaacaa 


catcccgacc 


900 


ct caacaagg 


ccatgt tctg 


gggcaccct c 


gtgggcgagt 


tcatcccgac 


cgccaccggc 


960 


atctgggagc 


tcggcctctc 


cgtgttcggc 


accgccgacc 


tctacatcga 


caacgagctc 


1020 


gtgatcgaga 


acaccaccca 


ccagacccgc 


ggcaccgcct 


tcttcggcaa 


gggcaccacc 


1080 


gagaaggtgg 


ccacccgccg 


catggtggcc 


ggctccacct 


acaagctccg 


cctcgagttc 


1140 


ggct ccgcca 


acaccaccaa 


gatggagacc 


accggcgtgg 


tgaacttcgg 


cggcggcgcc 


1200 


gtgcacc.cg 


gcgcct gcct 


caaggtggac 


ccgcaggaga 


tgatcgcccg 


cgccgtgaag 


1260 
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gccgccgccg 


acgccgacta 


caeca tcatc 


tgcaccggcc 


tctccggcga 


gtgggagtcc 


1320 


gagggct teg 


accgcccgca 


catggacctc 


ccgccgggcg 


tggacaccat 


gatctcccag 


1380 


gtgctcgacg 


ccgccccgaa 


cgccgtggtg 


gtgaaccagt 


ccggcacccc 


ggtgaccatg 


1440 


c cctgggccc 


acaaggccaa 


ggeca tegtg 


caggcctggt 


aeggeggcaa 


cgagaccggc 


1500 


cacggcatct 


ccgacg tgct 


ct teggcaac 


gtgaacccgt 


ccggcaagct 


ct ccctctcc 


1560 


tggccggtgg 


acg tgaagca 


caacccggcc 


tacctcaact 


acgcctccgt 


gggcggccgc 


1620 


gt gctct acg 


gcgaggacg t 


gt acgtgggc 


tacaagttct 


acgacaagac 


egagegegag 


1680 


gtgctcttcc 


cgt t cggcca 


eggect ct cc 


tacgccacct 


tcaagctccc 


ggactccacc 


1740 


gtgcgcaccg 


tgccggagac 


ct tccacccg 


gaccagccga 


ccgtggccat 


cgtgaagatc 


1800 


aagaacacct 


cctccgtgcc 


gggcgcccag 


gtgctccagc 


tctacatctc 


cgccccgaac 


1860 


cccccgaccc 


accgcccggt 


gaaggagct c 


caegget teg 


agaaggtgta 


cctcgaggcc 


1920 


ggcgagqaga 


aggaggtgca 


gatcccgat c 


gaccagtacg 


ccacctcct t 


ctgggacgag 


1980 


a i cgagtcca 


tgt ggaagtc 


cgagcgcggc 


atctacgacg 


tgctcgtggg 


ct tctcctcc 


2040 


caggagatct 


ccggcaaggg 


caagctcatc 


gtgeeggaga 


cccgcttctg 


gatgggcetc 


2100 


tag 












2103 


< 2 1 0 > 98 
<2 1 1 > 420 
<212> DNA 
<213> Zea 


mays 












<220> 

< 2 2 3 > 0 protein promoter 










<400> 98 
gggctggt aa 


at tacc tggg 


agcaat ggta 


tgeaaatect 


ttgcatgt ac 


gcaaaactag 


60 


ccagt t gtca 


caagi t gtat 


atcga t tegt 


cgcgt ttcaa 


caactcatgc 


aaca t tacaa 


120 


acaag taaca 


caa ta 1 1 aca 


aagt tagtt t 


ca tacaaagc 


aagaaaagga 


caat aatact 


180 


tgacatgtaa 


agt gaagct t 


attatacttc 


ctaat ccaac 


acaaaacaaa 


aaaaagt tgc 


240 


acaaaggtcc 


aaaaatccac 


a tcaacca t t 


aacctatacg 


taaagtgagt 


ga tgagtcac 


300 


aizatccaac 


aaatgt t tat 


caat gtggta 


tcatacaagc 


at tgacatcc 


cataaatgea 


360 


agaaattgtg 


ccaacaaagc 


ta taagtaac 


cctca tatgt 


atttgeaetc 


a tgcatcaca 


420 



<210> 99 
< 2 1 1 > 1188 
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< 2 1 2 > DNA 

< 2 1 3 > artificial sequence 
<220> 

< 22 3> synthetic ferulic acid esterase 
<400> 99 



atggccgcct 


ccctcccgac 


ca tgccgccg 


tccggctacg 


accaggtgcg 


caaeggegtg 


60 


ccgcgcggcc 


aggtgg tgaa 


ca tctcctac 


t tctccaccg 


ccaccaactc 


cacccgcccg 


120 


gcccgcgt gt 


acctcccgcc 


gggctactcc 


aaggacaaga 


agtactccg t 


gctctacctc 


180 


ctccacggca 


tcggcggctc 


cgagaacgac 


tggttcgagg 


gcggcggccg 


cgccaacgtg 


240 


atcgccgaca 


acctca tege 


cgagggcaag 


atcaagccgc 


teatcategt 


gaccccgaac 


300 


accaacgccg 


ccggcccggg 


ca tcgccgac 


ggctacgaga 


act tcaccaa 


ggacctcctc 


360 


aactccctca 


t cccgt aca t 


cgagtccaac 


tactccgtgt 


acaccgaccg 


cgagcaccgc 


420 


gcca tcgccg 


gcccctct3 t 


gggeggegge 


cagtcct tea 


acatcggcct 


caccaacctc 


480 


gacaagttcg 


cctaca t egg 


cccga t ct cc 


gccgccccga 


acacctaccc 


gaacgagege 


540 


cccc tcccgg 


aeggeggcaa 


ggccgcccgc 


gagaagctca 


agctcct ctt 


catcgcctgc 


600 


ggcaccaacg 


actccc teat 


egget t egge 


cagcgcgtgc 


acgagtactg 


cgtggccaac 


660 


aacatcaacc 


acgtgt actg 


gctcatccag 


ggcggcggcc 


acgact tcaa 


cgtgtggaag 


720 


ccgggcctct 


ggaact tcct 


ccagatggcc 


gaegaggecg 


gcctcacccg 


cgacggcaac 


780 


accccggcgc 


cgaccccgt c 


cccgaagccg 


gccaacaccc 


gcatcgaggc 


cgaggactac 


840 


gacggca tea 


octcctcctc 


catcgagatc 


atcggcgtgc 


cgccggaggg 


cggccgcggc 


900 


at egget aca 


t caccc cegg 


cgact acctc 


gtgtacaagt 


ccatcgact t 


cggcaacggc 


960 


gccacctcct 


ccaaggccaa 


ggtggccaac 


gccaacacct 


ccaacatcga 


gcttcgcctc 


1020 


aacggcccga 


acggcaccct 


catcggcacc 


ctctccgtga 


agtccaccgg 


cgactggaac 


1080 


acc tacgagg 


agcagacctg 


ctcca tctcc 


aaggtgaccg 


gcatcaacga 


cctctacctc 


1140 


gtgt tcaagg 


qcccggtgaa 


catcgact gg 


t tcaccttcg 


gcgtgtag 




1188 



<210> 100 

<21!> 395 

<212> PRT 

<2 1 3 > artificial sequence 
<220> 

<223> synthetic ferulic acid esterase 

<400> 100 
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Mec Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val 
1 5 10 16 



Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser Tyr Phe Ser 

20 25 30 



Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly 
35 40 45 



Tyr Ser Lvs Asp Lys Lys Tvr Ser Val Leu Tyr Leu Leu His Gly He 
50 55 60 



0 1 y Glv Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val 
' 70 7 5 80 



Tie Ala Asp Asn Leu lie Ala Glu Gly Lys lie Lys Pro Leu He He 

8 5 90 95 



Val Thr Pre Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr 

100 105 HO 



Giu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro Tyr He Glu 
115 120 125 

Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala lie Ala Gly 
130 J 135 140 



Leu Ser Mec Gly Gly Gly Gin Ser Phe Asn lie Gly Leu Thr Asn Leu 
145 1^0 155 160 



Asp Lys Fhe Ala Tyr He Gly Pro lie Ser Ala Ala Pro Asn Thr Tyr 

165 HO 175 



pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys 

180 185 190 



ieu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly 
195 200 205 



Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn lie Asn His 
210 215 220 

Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys 

225 230 235 240 

Page 116 



WO 2005/096804 PCT/US2004/007182 



Pro Glv Leu Trc Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr 

245 250 255 



Arg Aso Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn 

260 265 270 



Thr Arg lie Glu Ala Glu Asp Tyr Asp Gly lie Asn Ser Ser Ser lie 
275 280 285 



Glu lie lie Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr lie 
290 * 295 300 



Thr Ser Glv Asd Tyr Leu Val Tyr Lys Ser lie Asp Phe Gly Asn Gly 
305 310 315 320 



Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He 

325 330 335 



Glu Leu Ara Leu Asn Gly Pro Asn Gly Thr Leu lie Gly Thr Leu Ser 

340 ^ 345 350 



Vai Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser 
355 ' 360 365 



lie Ser Lys Val Thr Gly lie Asn Asp Leu Tyr Leu Val Phe Lys Gly 
370 ' 375 380 



Pro Val Asn lie Asp Trp Phe Thr Phe Gly Val 
385 390 395 



<210> 101 
<211> 1188 
< 2 i 2 > DMA 

<213> artificial sequence 
<220> 

<222> plasmid 13036 
<400> 101 

atqgccgcct ccctcccgac catgccgccg tccggctacg accaggtgcg caacggcgtg 
ccgcgcggcc aggtggtgaa catctcctac ttctccaccg ccaccaactc cacccgcccg 
gcccgcgtgt acctcccgcc gggctactcc aaggacaaga agtactccgt gctctacctc 



60 
120 
180 



ctccacggca ccggcggctc cgagaacgac tggttcgagg gcggcggccg cgccaacgtg 240 
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accgccgaca 


acctcatcgc 


cgagggcaag 


a tcaagccgc 


teatcategt 


gaccccgaac 


300 


accaacoccg 


ccggcccggg 


catcgccgac 


ggctacgaga 


acttcaccaa 


ggacctcctc 


360 


aactccctca 


icccgtacat 


cgagtccaac 


t actccgtgt 


acaccgaccg 


cgagcaccgc 


420 


qcca tcgccg 


gectet ctat 


gggeggegge 


cagtccttca 


acatcggcct 


caccaacctc 


480 


gacaagt ccg 


cctaca tegg 


cccgatctcc 


gccgccccga 


acacccaccc 


gaacgagege 


54 0 


ctct tcccgg 


aeggeggcaa 


ggccgcccgc 


gagaagctca 


agctcctctt 


caccgcctgc 


600 


ggcaccaacg 


actccctca t 


egget tegge 


cagcgcgtgc 


acgagtactg 


cgtggccaac 


660 


aaca tcaacc 


aegtg t actg 


gctca tccag 


ggcggcggcc 


acgact tcaa 


cgtgtggaag 


720 


ccgcgcc t ct 


ggaac c tccc 


ccagatggcc 


gaegaggecg 


gccccacccg 


cgacggcaac 


780 


accccggt gc 


cgaccccgtc 


cccgaagccg 


gccaacaccc 


gcatcgaggc 


cgaggactac 


840 


gacggca tea 


actcctcctc 


ca tcgagatc 


atcggcgtgc 


cgccggaggg 


cggccgcggc 


900 


a tcaactaca 


tcacctccgg 


cgac t acct c 


gtgtacaagt 


ccatcgact t 


cggcaacggc 


960 


gccaccccct 


teaaggecaa 


ggtggccaac 


gccaacacct 


ccaacatcga 


gcttcgcctc 


1020 


aaccgcccga 


acggcaccct 


ca ccggcacc 


ctctccgtga 


agtccaccgg 


cgactggaac 


1080 


acctacgagg 


agcagacctg 


ctcca tctcc 


aaggtgaccg 


gcaccaacga 


cctctacctc 


1140 


gtgt tcaagg 


gcccggtgaa 


caccgactgg 


t tcaccctcg 


gcgtgtag 




1188 



<210> 102 
< 2 1 1 > 395 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 13036 
<400> 102 

Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val 

10 15 



Arg Asn Glv Val Pro Arg Gly Gin Val Val Asn lie Ser Tyr Phe Ser 

20 25 30 



Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly 
35 40 45 



Tvr Ser 
50 



Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu His Gly lie 

55 60 
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Glv Glv Ser Giu Asn Asd Trp Phe Glu Gly Gly Gly Arg Ala Asn Val 
65* ' 70 75 80 



lie Ala Asp Asn Leu lie Ala Glu Gly Lys lie Lys Pro Leu lie lie 

85 90 95 



Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly lie Ala Asp Gly Tyr 

100 105 110 



Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu Tie Pro Tyr He Glu 
U5 120 125 



S e r asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala He Ala Gly 
130 135 1^0 



Leu Ser Met Gly Glv Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu 
145 * ' 1 50 155 160 



Asd Lys Phe Ala Tyr He Gly Pro lie Ser Ala Ala Pro Asn Thr Tyr 

165 170 175 



Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys 

130 185 190 

Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly 
195 200 205 

Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn He Asn His 

210 ' 215 220 



va' Tyr Trp Leu He Gin Gly Gly Glv His Asp Phe Asn Val Trp Lys 

225 230 235 240 

»ro G'y Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr 

245 250 255 



*rq Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn 

260 265 270 

Thr Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser Ser Ser He 

275 280 285 
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Giu Tie lie Gly Val Pro Pro Giu Gly Gly Arg Gly He Gly Tyr He 
290 295 300 



Thr Ser Giv Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly 
305 310 315 320 



Ala Thr Ser Phe Lys Ala Lvs Val Ala Asn Ala Asn Thr Ser Asn lie 

325 330 335 



Giu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu lie Gly Thr Leu Ser 

340 345 350 



Val Lys Ser Thr Giy Asp Trp Asn Thr Tyr Giu Giu Gin Thr Cys Ser 
355 360 365 



lie Ser Lys Val Thr Gly lie Asn Asp Leu Tyr Leu Val Phe Lys Gly 
370 ' 375 380 



Pro Val Asn lie Asp Trp Phe Thr Phe Gly Val 



jiS 390 395 



<2)0> 103 

< 2 i 1 > 12-15 

<2!2> ON A 

<?!3> artificial sequence 



<220> 

< 2 2 3 > plasmid 1 3038 



<.;00> 103 
atgaggqtgt 


tgctcgt tgc 


cctcgctctc 


ctggctctcg 


ctgcgagcgc 


cacctccatg 


60 


gccgcctccc 


tcccgacca t 


gccgccg tec 


ggc tacgacc 


aggtgcgcaa 


cggcgtgccg 


120 


cgcggcoagg 


eggtgaaca t 


ctcct act tc 


tccaccgcca 


ccaactccac 


ccgcccggcc 


180 


cgcgtgtacc 


tcccgccggg 


ctactccaag 


gacaagaagt 


actccgtgct 


ctacctcctc 


240 


cacggca f; eg 


gcggctccga 


gaacgacegg 


t tegagggeg 


gcggccgcgc 


caaegtgate 


300 


gccgacaacc 


tcaccgccga 


gggcaagatc 


aagccgctca 


teategtgae 


cccgaacacc 


360 


aacgccgccg 


gcccgggcac 


cgccgacggc 


t acgagaact 


tcaccaagga 


cctcctcaac 


420 


tccctcatcc 


cgtacaccga 


gtccaactac 


tccgtgtaca 


ccgaccgcga 


gcaccgcgcc 


480 


atcgccggcc 


tccctatggg 


cggcggccag 


tccttcaaca 


tcggcctcac 


caacctcgac 


540 


aagc tegect 


aca tcggccc 


gatctccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcctc 


600 


ttcccggacg 


geggcaagge 


cgcccgcgag 


aagctcaagc 


tcctcttca t 


cgcctgcggc 


660 
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accaacoact 


ccctcatcgg 


ct teggecag 


cgcgtgcacg 


agtactgcgt 


ggccaacaac 


720 


a tcaaccacg 


tgtactggct 


catccagggc 


ggcggccacg 


acttcaacgt 


gtggaagccg 


780 


ggcctctgga 


act t cctcca 


gatggecgae 


gaggccggcc 


tcacccgcga 


cggcaacacc 


840 


ccgg tgccga 


ccccgtcccc 


gaagccggcc 


aacacccgca 


tegaggcega 


ggactacgac 


900 


ggcat caact 

* 


ccccctccat 


cgaga t catc 


ggcgtgccgc 


eggagggegg 


ccgcggcatc 


'960 


ggc tacatca 


cctccggcga 


ctacctcgtg 


tacaagtcca 


tegacttegg 


caacggcgcc 


1020 


acctcct tea 


aggecaaggt 


ggeca aegee 


aacacctcca 


acatcgagct 


tcgcctcaac 


1080 


ggcccgaacg 


gcaccctca t 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1140 


uacgaggagc 


agacct gctc 


catct ccaag 


gtgaceggea 


tcaacgacct 


ctacctcgtg 


1200 


c t caagggee 


ccg t gaaca t 


cgactggt cc 


acct teggeg 


tgtag 




1245 



<210> 104 
< 2 1 1 > 414 
<212> ?RT 

< 2 1 3 > artificial sequence 
<220> 

<223> plasmid 13038 aa 
<400> 104 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 



Ala Thr Ser Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr 

20 25 30 



Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser 
35 40 4 5 



Tyr ?he Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 



Pro Pro Giy Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 
65 70 75 80 



hh s Gly He Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg 

- 85 90 95 



Ala Asn Val lie Ala Asp Asn Leu He Ala Glu Gly Lys lie Lys Pro 

100 105 110 
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Leu lie lie Val 
115 



Asp Gly Tyr Glu 
130 



Tyr lie Glu Ser 
145 



Tie Ala Gly Leu 



Thr Asn Leu Asp 

180 



Asn Thr Tyr Pro 
195 



Arg Glu Lys Leu 
210 



Leu lie Gly Phe 

225 



lie Asn His Val 



Val Tro Lys Pro 

260 



Gly Leu Thr Arg 
275 



Pro Ala Asn Thr 
290 



Ser Ser lie Glu 
305 



Gly Tyr lie Thr 



Thr Pro Asn Thr 

120 



Asn Phe Thr Lys 
135 



Asn Tyr Ser Val 
150 



Ser Met Gly Gly 
165 



Lys Phe Ala Tyr 



Asn Glu Arg Leu 

200 



Lys Leu Leu Phe 
215 



Gly Gin Arg Val 
230 



Tyr Trp Leu lie 
245 



Gly Leu Trp Asn 



Asp Gly Asn Thr 

280 



Arg He Glu Ala 
295 



He He Gly Val 
310 



Ser Gly Asp Tyr 
325 



Asn Ala Ala Gly 



Asp Leu Leu Asn 

140 



Tyr Thr Asp Arg 
155 



Gly Gin Ser Phe 

no 



He Gly Pro lie 
185 



Phe Pro Asp Gly 



Tie Ala Cys Gly 

220 



His Glu Tyr Cys 
235 



Gin Gly Gly Gly 
250 



Phe Leu Gin Met 

265 



Pro Val Pro Thr 



Glu Asp Tyr Asp 

300 



Pro Pro Glu Gly 
315 



Leu Val Tyr Lys 
330 



Pro Gly He Ala 
125 



Ser Leu He Pro 



Glu His Arg Ala 

160 



Asn He Gly Leu 
175 



Ser Ala Ala Pro 
190 



Gly Lys Ala Ala 
205 



Thr Asn Asp Ser 



Val Ala Asn Asn 

240 



His Asp Phe Asn 
255 



Ala Asp Glu Ala 

270 



Pro Ser Pro Lys 
285 



Gly lie Asn Ser 



Gly Arg Gly He 

320 



Ser He Asp Phe 
335 
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Gly Asn Gly Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr 

340 345 350 

Ser Asn lie Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly 
355 360 365 

Thr Leu Ser Val Lvs Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin 
370 * 375 380 

Thr Cys Ser lie Ser Lys Val Thr Gly lie Asn Asp Leu Tyr Leu Val 
385 390 395 400 

Phe Lys Gly Pro Val Asn lie Asp Trp Phe Thr Phe Gly Val 

4 05 4 10 

<2!0> 105 
< 2 1 1 > • 1425 
<212> DNA 

< 2 1 3 > artificial sequence 
<220> 

<223> plasmid 13039 



<400> 105 



a tgctggcgg 


ctctggccac 


gtcgcagctc 


gtcgcaacgc 


gcgccggcct 


gggcgtcccg 


60 


gacgcgtcca 


cgc tccgccg 


cggcgccgcg 


cagggectga 


ggggggcccg 


ggcgtcggcg 


120 


gcggcggaca 


cgctcagcac 


gcggaccagc 


gcgcgcgcgg 


cgcccaggca 


ccagcaccag 


180 


caggcgcgcc 


gcggggccag 


gttcccgtcg 


ctcgtcgtgt 


gcgccagcgc 


cggcgccatg 


240 


gccgcctccc 


ccccgaccat 


gccgccgtcc 


ggct acgacc 


aggtgcgcaa 


cggcgtgccg 


300 


cgcggccagg 


tggtgaacat 


ctcctacttc 


tccaccgcca 


ccaactccac 


ccgcccggcc 


360 


cgcgtg tacc 


c cccgccggg 


ctact ccaag 


gacaagaagt 


actccgtgct 


ctacctcctc 


420 


cacggcatcg 


gegget ccga 


gaacgactgg 


t tegagggeg 


gcggccgcgc 


caaegtgat c 


480 


gccgacaacc 


ccatcgccga 


gggcaagatc 


aagccgctca 


teategtgae 


cccgaacacc 


540 


aacgccgccg 


gcccgggcat 


cgccgacggc 


tacgagaact 


tcaccaagga 


cctcctcaac 


600 


cccccca tec 


egtacatega 


atccaaccac 


tccgtgtaca 


ccgaccgcga 


gcaccgcgcc 


660 


atcgccggcc 


tctctatggg 


cggcggccag 


tcct ccaaca 


tcggcctcac 


caacctcgac 


720 


aagt ccgccc 


acatcggccc 


gacccccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcctc 


780 


ttcccggacg 


geggcaagge 


cgcccgcgag 


aagctcaagc 


tcctcttcat 


cgcctgcggc 


840 


accaacgacu 


ccctca tegg 


cttcggccag 


cgcgtgcacg 


agtactgcgt 


ggccaacaac 


900 
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a tcaaccacg 


cgcactggcc 


cacccagggc 


ggcggccacg 


actt caacgt 


gtggaagccg 


960 


ggcctctgga 


actt cctcca 


gatggccgac 


gaagccggcc 


tcacccgcga 


cggcaacacc 


1020 


ccggcgccga 


ccccgccccc 


gaagccggcc 


aacacccgca 


tcgaggccga 


ggactacgac 


1080 


ggcatcaact 


cct cctccat 


cgaga tcatc 


ggcgtgccgc 


cggagggcgg 


ccgcggcatc 


1140 


ggctacat ca 


cctccggcga 


ctacctcgtg 


tacaagtcca 


tcgacttcgg 


caacggcgcc 


1200 


acctcct tea 


aggccaaggt 


ggccaacgcc 


aacacctcca 


acatcgagct 


tcgcctcaac 


1260 


ggcccgaacg 


gcacccccat 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1320 


cacgaggagc 


agacccgctc 


ca tctccaag 


gtgaccggca 


tcaacgacct 


ctacctcgtg 


1380 


t tcaaqggcc 


cog t gaaca t 


cgactggt t c 


accttcggcg 


tgtag 




1425 



<2 3 0> 106 

< 2 1 1 > 4 1 4 

<212> PRT 

<2i3> artificial sequence 
<22C> 

<22 3> plasmid 13039 aa 

<400> 106 

Me' Leu Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly 
1 5 10 15 



Leu Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly 

20 2 5 30 



Leu Arc Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg 
35 40 45 



Th*- Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg 
50 55 60 

G»" Ala Arc Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Ala Met 

Sb ' 70 75 80 

Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val Arg 

85 90 95 



fisn dy Va> Pro Arg Gly Gin Val Val Asn lie Ser Tyr Phe Ser Thr 

loo ' 105 no 
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Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro-Gly Tyr 
115 120 125 



Ser Lvs Aso Lvs Lvs Tyr Ser Val Leu Tyr Leu Leu His Gly lie Gly 
130 135 140 



Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val lie 
145 ISO 155 160 



Ala Asd Asn Leu lie Ala Glu Gly Lys lie Lys Pro Leu lie lie Val 

165 HO 175 



Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr Glu 

180 185 190 



Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu lie Pro Tyr He Glu Ser 
195 ' 200 205 



Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala He Ala Gly Leu 
210 215 220 



Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu Asp 
225 ' J 230 235 240 



Lys Phe Ala Tyr He Gly Pro lie Ser Ala Ala Pro Asn Thr Tyr Pro 

245 250 255 



Asn Glu Arg Leu Phe Pro Asp Glv Gly Lys Ala Ala Arg Glu Lys Leu 

260 265 270 



Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu lie Gly Phe 
275 280 285 



Gly Gin Arg Val His Glu Tvr Cys Val Ala Asn Asn lie Asn His Val 
290 295 300 



Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys Pro 
305 310 315 320 



Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr Arg 

325 330 335 



Asp Glv Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn Thr 

340 345 350 
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Arg lie Glu Ala Glu Asp Tyr Asp Gly lie Asn Ser Ser Ser He Glu 
355 360 365 



lie lie Gly Val Pro Pro Glu Gly Gly Arg Gly lie Gly Tyr He Thr 
370 375 380 



Ser Gly Asp Tvr Leu Val Tyr Lys Ser lie Asp Phe Gly Asn Gly Ala 

335 390 395 400 

Thr ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn lie Glu 

4 05 410 415 



Le" Arg Leu Asn Gly Pro Asn Gly Thr Leu lie Gly Thr Leu Ser Val 

4 20 4 25 4 30 



Lys Ser Thr Glv Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser He 
4 35 4 40 44 5 



Ser Lvs Val Thr Gly lie Asn Asp Leu Tyr Leu Val Phe Lys Gly Pro 
4 50 4 55 4 60 



Val Asn lie Asp Trp Phe Thr Phe Gly Val 
465 470 



<210> 107 

<211> 1263 

<212> DMA 

<2!3> artificial sequence 
<220> 

<223> plasmid 13347 



acgagggtgc 


tgctcgt tgc 


cctcgctccc 


ctggctctcg 


ctgcgagcgc 


cacctccatg 


60 


gccgcc tccc 


icccgacca c 


qccgccgtcc 


ggctacgacc 


aggtgcgcaa 


cggcgtgccg 


120 


cgcggccagg 


tggtgaaca t 


ct cccact cc 


tccaccgcca 


ccaactccac 


ccgcccggcc 


180 


cgcgtgtacc 


tcccgccggg 


ct actccaag 


gacaagaagt 


actccgtgct 


ctacctcctc 


240 


cacggcaccg 


gcggccccga 


gaacgactgg 


t tegagggeg 


gcggccgcgc 


caacgtga tc 


300 


qccgacaacc 


tea ccgccga 


gggcaagatc 


aagccgctca 


ccatcgtgac 


cccgaacacc 


360 


aacgccgccg 


gcccgggca c 


cgccgacggc 


tacgagaact 


tcaccaagga 


cctcctcaac 


420 


tccctcatcc 


cgt acaccga 


gcccaactac 


t ccgtgtaca 


ccgaccgcga 


gcaccgcgcc 


480 
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a t cgccggcc 


tct ctatggg 


cggcggccag 


tcct tcaaca 


tcggcctcac 


caacctcgac 


540 


aagttcgcct 


acatcggccc 


ga tctccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcctc 


600 


t tcccggacg 


geggcaagge 


cgcccgcgag 


aagctcaagc 


tcctcttcat 


cgcctgcggc 


660 


accaacgact 


ccctcaccgc 


ct t eggecag 


cgcgtgcacg 


agtactgcgt 


ggccaacaac 


720 


atcaaccacg 


tot actggct 


ca tccagggc 


ggcggccacg 


acttcaacgt 


gtggaagccg 


780 


ggcctctgga 


act tcctcca 


ga tggccqac 


gaggccggcc 


tcacccgcga 


cggcaacacc 


840 


cccgtgccga 


ccccgtcccc 


gaagccggcc 


aacacccgca 


tegaggcega 


ggactacgac 


900 


ggca t caact 


cctcctcca t 


cgaga tea t c 


ggcgtgccgc 


eggagggegg 


ccgcggca tc 


960 


ggccaca tea 


cctccggcga 


ctacctcgtg 


tacaagtcca 


tegacttegg 


caacggcgcc 


1020 


acctcct tea 


aggecaaggt 


ggccaacgcc 


aacacctcca 


acatcgagct 


tcgcctcaac 


1080 


ggcccgaacg 


gcaccctca c 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1140 


r acgaggagc 


agacctgctc 


ca tct ccaag 


gtgaceggea 


t caacgacct 


ctacctcgtg 


1200 


c teaagggee 


eggtgaacat 


cgact ggt tc 


acct teggeg 


tgtccgagaa 


ggacgaactc 


1260 


tag 












1263 



< 2 1 0 > 108 
<2U> 420 

<2i2> par 

<213> artificial sequence 
<220> 

<223> plasmid 13347 
<400> 108 

Kef Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
,5 10 15 

Ala Thr Ser Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr 

20 25 30 

Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser 
3^ 4 0 4 5 

Tyr ?he Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 

Pro Pro Gly Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 
6 c 70 75 80 
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His Glv He Glv 



Ala Asn Val lie 

100 



Leu lie lie Val 
115 



Asd Gly Tyr Glu 
130 



Tyr lie Glu Ser 
H5 



!le Ala Gly Leu 



Thr Asn Leu Asp 

180 



Asn Thr Tyr Pro 
195 



Arg Glu Lys Leu 
210 



Leu He Gly Phe 
225 



1 1 e Asn His Va 1 



Val Trp Lvs Pro 

260 



Gly Leu Thr Arq 
275 



Pro Ala Asn Thr 
290 



Gly Ser Glu Asn 
65 



Ala Asd Asn Leu 



Thr Pro Asn Thr 

.120 



Asn Phe Thr Lys 
135 



Asn Tyr Ser Val 
150 



Ser Met Gly Gly 
165 



Lys Phe Ala Tyr 



Asn Glu Arg Leu 

200 



Lvs Leu Leu Phe 
215 



Gly Gin Arg Val 
230 



Tvr Trp Leu lie 
245 



Gly Leu Trp Asn 



Asp Glv Asn Thr 

280 



Arg lie Glu Ala 
295 



Asp Trp Phe Glu 
90 



lie Ala Glu Gly 

105 



Asn Ala Ala Gly 



Asp Leu Leu Asn 

HO 



Tyr Thr Asp Arg 
155 



Gly Gin Ser Phe 
170 



lie Gly Pro lie 

185 



Phe Pro Asp Gly 



lie Ala Cys Gly 

220 



His Glu Tyr Cys 
235 



Gin Gly Gly Gly 
250 



Phe Leu Gin Met 

265 



Pro Val Pro Thr 



Glu Asp Tyr Asp 

300 



Gly Gly Gly Arg 
95 



Lys He Lys Pro 
110 



Pro Gly He Ala 
125 



Ser Leu He Pro 



Glu His Arg Ala 

160 



Asn He Gly Leu 
175 



Ser Ala Ala Pro 
190 



Gly Lys Ala Ala 
205 



Thr Asn Asp Ser 



Val Ala Asn Asn 

240 



His Asp Phe Asn 
255 



Ala Asp Glu Ala 
270 



Pro Ser Pro Lys 
285 



Gly He Asn Ser 
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q e r ^er He G>u He lie Gly Val Pro Pro Glu Gly Gly Arg Gly lie 

305 310 315 320 

Gly Tyr lie Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe 

325 330 335 

Gly Asn Gly Ala Thr Ser Phe Lvs Ala Lys Val Ala Asn Ala Asn Thr 

340 345 350 

Ser Asn lie Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu lie Gly 

355 360 365 

Thr Lpu Ser Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin 

370 375 380 

Thr Os Ser He Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val 

385 ' 390 395 400 



Phe L"S GW Pro Val Asn He Asp Trp Phe Thr Phe Gly Val Ser Glu 

.305 410 415 



Lys Asp Glu Leu 

4 20 



<210> 109 

<211> 1296 

< 2 1 2 > ON A 

<213> artificial sequence 
<220> 

<223> plasmid 11267 

<400> 109 



tgctcgt tgc 


cctcgctctc 


ctggctctcg 


ctgcgagcgc 


caccagcgct 


60 


agccggagct 


gaagctggag 


tccgtggtga 


tcgtgtcccg 


ccacggcgtg 


120 


ccaaggccac 


ccagctcatg 


caggacgtga 


ccccggacgc 


ctggccgacc 


180 


agcccggcga 


gctgaccccg 


cgcggcggcg 


agetgatege 


ctacctcggc 


240 


gccagcgccc 


cgtggccgac 


ggect cctcc 


egaagtgegg 


ctgcccgcag 


300 


t ggcca tea t 


cgccgacgtg 


gaegagegea 


cccgcaagac 


cggcgaggcc 


360 


gcctcgcccc 


ggactgcgcc 


atcaccgtgc 


acacccaggc 


cgacacctcc 


420 


egctet tcaa 


cccgctcaag 


accggcgtgt 


gccagctcga 


caacgccaac 


480 


ccatcctgga 


gcgcgccggc 


ggctccatcg 


ccgacttcac 


cggccactac 


540 
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cagaccgcct 


tccgcgagct 


ggagcgcgtg 


ctcaact tec 


cgcagtccaa 


cctctgcctc 


600 


aagcgcgaga 


agcaggacga 


gtcctgctcc 


ctcacccagg 


ccctcccgtc 


cgagctgaag 


660 


qtqtccqccq 


actgcgtgtc 


cctcaccggc 


gccgtgtccc 


tcgcctccat 


gctcaccgaa 


720 


at ct tec tec 


tccagcaggc 


ccagggcat g 


ccggagccgg 


gctggggccg 


catcaccgac 


780 


t cccaccagt 


ggaacaccct 


cctctccctc 


cacaacgccc 


agt tcgacct 


cctccagcgc 


84 0 


accccggagg 


tggcccgct c 


ccgcgccacc 


ccgctcctcg 


acctcatcaa 


gaccgccctc 


900 


accccgcacc 


cgccgcagaa 


gcaggcctac 


ggcgtgaccc 


tcccgacctc 


cgtgctct tc 


960 


atcgccggcc 


acgacaccaa 


cctcgccaac 


ctcggcggcg 


ccctggagct 


gaactggacc 


1020 


ctcccgggcc 


ageeggacaa 


caccccgccg 


ggeggegage 


tggtgttcga 


gcgctggcgc 


1080 


cgcctctccg 


acaactccca 


gtggat tcag 


gtgtccctcg 


tgttccagac 


cctccagcag 


1140 


a tgegegaca 


agaccccgct 


ctccctcaac 


accccgccgg 


gcgaggtgaa 


gctcaccctc 


1200 


gccggctgcg 


aggagegcaa 


cgcccagggc 


a tgtgctccc 


tcgccggctt 


cacccaga tc 


1260 


gtcaacgagg 


cccgca tccc 


ggcctgctcc 


ctctaa 






1296 



<210> 110 

<211> 431 

< 2 1 2 > PRT 

<213> artificial sequence 
<220> 

< 2 2 3 > plasmid 1 1267 aa sequence 

<400> 110 

Met. Arq Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
1 5 10 15 

Ala Thr Ser Ala Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val 

20 25 30 

Vai lie Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin 
35 40 45 

Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys 
50 55 60 

Leu Gly Glu Leu Thr Pro Arg Gly Gly Glu Leu lie Ala Tyr Leu Gly 
65 70 75 80 
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His Tyr Trp Arg 



Gly Cys Pro Gin 

100 



Arg Thr Arg Lys 
115 



Cys Ala He Thr 
130 



Leu Phe Asn Fro 
145 



Val T'nr Asp Ala 



Thr Gly His Tyr 



Phe Pro Gin Ser 
1 95 



Cys Ser Leu Thr 
210 



Cvs Val Ser Leu 
225 



lie Phe Leu Leu 



Arg He Thr Asp 

260 



Ala Gin Phe Asp 
275 



Ala Thr Pro Leu 
290 



Frc Gin Lys Gin 
305 



Gin Arg Leu Val 
85 



Ser Gly Gin Val 



Thr Gly Glu Ala 

120 



Val His Thr Gin 
135 



Leu Lys Thr Gly 
150 



lie Leu Glu Arg 
165 



Gin Thr Ala Phe 



Asn Leu Cys Leu 

200 



Gin Ala Leu Pro 
215 



Thr Gly Ala Val 
230 



Gin Gin Ala Gin 
245 



Ser His Gin Trp 



Leu Leu Gin Arg 

230 



Leu Asp Leu He 
295 



Ala Tyr Gly Val 
310 



Ala Asp Gly Leu 
90 



Ala He He Ala 
105 



Phe Ala Ala Gly 



Ala Asp Thr Ser 

140 



Val Cys Gin Leu 
155 



Ala Gly Gly Ser 
170 



Arg Glu Leu Glu 
185 



Lys Arg Glu Lys 



Ser Glu Leu Lys 

220 



Ser Leu Ala Ser 
235 



Gly Met Pro Glu 
250 



Asn Thr Leu Leu 

265 



Thr Pro Glu Val 



Lys Thr Ala Leu 

300 



Thr Leu Pro Thr 
315 
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95 



Asp Val Asp Glu 
110 



Leu Ala Pro Asp 
125 



Ser Pro Asp Pro 



Asp Asn Ala Asn 

160 



He Ala Asp Phe 
175 



Arg Val Leu Asn 
190 



Gin Asp Glu Ser 
205 



Val Ser Ala Asp 



Met Leu Thr Glu 

240 



Pro Gly Trp Gly 
255 



Ser Leu His Asn 
270 



Ala Arg Ser Arg 
285 



Thr Pro His Pro 



Ser Val Leu Phe 

320 
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lie Ala Glv His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu Glu 

325 330 335 



Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly Gly 

340 345 350 



Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin Trp 
355 360 365 



Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp Lys 
3?0 375 330 



Th- Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr Leu 

3S5 390 395 400 

Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala Gly 

,305 410 4 15 



Phe Thr Gin Tie Val Asn Glu Ala Arg lie Pro Ala Cys Ser Leu 

420 425 430 



<210> 
<211> 
<212> 
<2!3> 


1 1 1 

1314 

DHA 

artificial sequence 










< 220> 
<223> 


plasttid 1)269 












<400> Hi 
a tgaggg t g t 


tgctcgt cgc 


cctcgctctc 


ctggctctcg 


ctgcgagcgc 


caccagcgct 


60 


gcgcag 


tccg 


agccggagct 


gaagctggag 


tccgtggtga 


tcgtgtcccg 


ccacggcgtg 


120 


cgcgcc 


ccga 


ccaaggccac 


ccagctcatg 


caggacgtga 


ccccggacgc 


ctggccgacc 


180 


•igcccggtga 


agct cggcga 


gctgaccccg 


cgcggcggcg 


agetgatege 


ctacctcggc 


240 


cact actggc 


gccagcgcct 


cgtggccgac 


ggcct cctcc 


egaagtgegg 


ctgcccgcag 


300 


cccggccagg 


tggccatca t 


cgccgacgtg 


.gacgagcgca 


cccgcaagac 


cggcgaggcc 


360 


t tcgccgccg 


gcctcgcccc 


ggactgcgcc 


a tcaccgtgc 


acacccaggc 


cgacacctcc 


420 


tccccggacc 


cgctcttcaa 


cccgctcaag 


accggcgtgt 


gccagctcga 


caacgccaac 


480 


gtgaccgacg 


cca tcctgga 


gcgcgccggc 


ggctcca teg 


ccgact tcac 


cggccactac 


540 


cagacr 


gccc 


tccgcgagct 


ggagcgcgtg 


ctcaact tec 


cgcagtccaa 


cctctgcctc 


600 
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aaocgccaga 


agcaggacga 


gtcctgctcc 


ctcacccagg 


ccctcccgtc 


cgagctgaag 


660 


gcgtccgccg 


actgcgtgtc 


cctcaccggc 


gccgtgtccc 


tcgcctccat 


gctcaccgaa 


720 


atct tccccc 


t ccagcaggc 


ccagggcatg 


ccggagccgg 


gctggggccg 


catcaccgac 


780 


ccccaccagt 


ggaacaccct 


cctctccctc 


cacaacgccc 


agttcgacct 


cctccagcgc 


840 


accccggagg 


cggcccgctc 


ccgcgccacc 


ccgctcctcg 


acctcatcaa 


gaccgccctc 


900 


accccgcacc 


cgccgcagaa 


gcaggcctac 


ggcgtgaccc 


tcccgacctc 


cgtgctct tc 


960 


atcgccggcc 


acgacaccaa 


cct cgccaac 


ctcggcggcg 


ccctggagct 


gaactggacc 


1020 


ctcccgggcc 


agccggacaa 


caccccgccg 


ggcggcgagc 


tggtgttcga 


gcgctggcgc 


1080 


cgcc tctccg 


acaact ccca 


gtggattcag 


gtgtccctcg 


tgttccagac 


cctccagcag 


1140 


a tgcgcgaca 


agaccccgct 


ct ccctcaac 


accccgccgg 


gcgaggtgaa 


get caccctc 


1200 


gccggct gcg 


aggagcgcaa 


cgcccagggc 


a tgtgctccc 


tcgccggctt 


cacccagatc 


1260 


gtqaacgagg 


cccgca tccc 


ggcctgct cc 


ctctccgaga 


aggacgagct 


gtaa 


1314 



<210> 112 

< 2 1 1 > 4 37 

<212> PRT 

<213> artificial sequence 



<220> 
<223> 



plasmid 11268 amino acid sequence 



<400> 



112 



Mec Arg Val 



Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
5 10 15 



Ala Thr Ser 



A^a Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val 
20 25 30 



Val lie Val 



Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin 




Leu Met Gin 
50 



Asp 



Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys 
55 60 



Leu Gly Glu 



Leu Thr Pro Arg Gly Gly Glu Leu lie Ala Tyr Leu Gly 
70 "?5 80 



His Tyr Trp 



Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Pro Lys Cys 
85 90 95 
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Glv Cys Pro Gin Ser Gly Gin Val Ala lie lie Ala Asp Val Asp Glu 

100 105 110 



Arc Thr Arg Lvs Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro Asp 
115 ' 120 125 

Cys Ala lie Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp Pro 
130 135 1^0 

Ipu phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala Asn 

145 150 155 160 

Val Thr Asp Ala lie Leu Glu Arg Ala Gly Gly Ser He Ala Asp Phe 

.165 ' 170 175 



T^r Gl" His Tvr Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu Asn 

180 185 190 



Ph* "ro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu Ser 
195 200 205 

Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala Asp 
* 210 215 220 

Cys Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr Glu 
225 230 235 240 

Pe Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp Gly 

2 45 250 255 



Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His Asn 
260 265 270 



A • a Gin Phe Asp Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser Arg 
275 280 285 

Ala Thr Pro Leu Leu Asd Leu lie Lys Thr Ala Leu Thr Pro His Pro 
290 * 295 300 

*ro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu Phe 

305 310 315 320 
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lie Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu Glu 

325 330 335 



Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly Gly 

340 345 350 

Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin Trp 
355 360 365 

lie Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp Lys 
370 3"S 380 

Th- P-o Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr Leu 
365 * 390 395 400 

h\a ~lv Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala Gly 

.305 410 415 

PhP Thr Gin lie Val Asn Glu Ala Arg lie Pro Ala Cys Ser Leu Ser 

420 425 430 



Glu Lys Asp Glu Leu 
435 
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